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NUCLEIC ACID MOLECULES ENCODING SERINE PROTEASE 16, THE 
ENCODED POLYPEPTIDES AND METHODS BASED THEREON 

RELATED APPLICATIONS 

Benefit of priority under 35 U.S.C. §1 19(e) is claimed to U.S. 
5 provisional application Serial No. 60/394,347, filed July 2, 2002, to 

Edwin L. Madison, Edgar O. Ong and Juinn-Chern Yeh, entitled "NUCLEIC 
ACID MOLECULES ENCODING SERINE PROTEASE 16, THE ENCODED 
PROTEINS AND METHODS BASED THEREON." The subject matter of 
each of the provisional application and International PCT application No. 

10 Docket No. 24745-1 625PC, filed on the same day herewith, entitled "NU- 
CLEIC ACID MOLECULES ENCODING SERINE PROTEASE 16, THE EN- 
CODED POLYPEPTIDES AND METHODS BASED THEREON", is incorpora- 
ted by reference in it entirety. 
FIELD OF INVENTION 

15 Nucleic acid molecules that encode proteases and portions thereof, 

particularly protease domains are provided. Also provided are prognostic, 
diagnostic and therapeutic methods using the proteases and domains 
thereof and the encoding nucleic acid molecules. 
BACKGROUND OF THE INVENTION AND OBJECTS THEREOF 

20 Cancer is a leading cause of death in the United States, developing 

in one in three Americans; one of every four Americans dies of cancer. 
Cancer is characterized by an increase in the number of abnormal 
neoplastic cells, which proliferate to form a tumor mass, the invasion of 
adjacent tissues by these neoplastic tumor cells, and the generation of 

25 malignant cells that metastasize via the blood or lymphatic system to 
regional lymph nodes and to distant sites. 

Among the hallmarks of cancer is a breakdown in the communi- 
cation among tumor cells and their environment. Normal cells do not 
divide in the absence of stimulatory signals, and cease dividing in the 

30 presence of inhibitory signals. Growth-stimulatory and growth-inhibitory 
signals are routinely exchanged between cells within a tissue. In a 
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cancerous, or neoplastic, state, a cell acquires the ability to "override" 
these signals and to proliferate under conditions in which normal cells do 
not grow. 

In order to proliferate, tumor cells acquire a number of distinct 
5 aberrant traits reflecting genetic alterations. The genomes of certain well- 
studied tumors carry several different independently altered genes, 
including activated oncogenes and inactivated tumor suppressor genes. 
Each of these genetic changes appears to be responsible for imparting 
some of the traits that, in the aggregate, represent the full neoplastic 

10 phenotype. 

A variety of biochemical factors have been associated with 
different phases of metastasis. Cell surface receptors for collagen, 
glycoproteins such as laminin, and proteoglycans, facilitate tumor cell 
attachment, an important step in invasion and metastases. Attachment 

15 triggers the release of degradative enzymes which facilitate the 

penetration of tumor cells through tissue barriers. Once the tumor cells 
have entered the target tissue, specific growth factors are required for 
further proliferation. Tumor invasion and progression involves a complex 
series of events, in which tumor cells detach from the primary tumor, 

20 break down the normal tissue surrounding it, and migrate into a blood or 
lymphatic vessel to be carried to a distant site. Destruction and/or 
remodeling of normal tissue barriers is accomplished by the elaboration of 
specific enzymes that degrade the proteins of the extracellular matrix that 
make up basement membranes and stromal components of tissues. 

25 A class of extracellular matrix degrading enzymes has been 

implicated in tumor invasion. Among these are the matrix 
metalloproteinases (MMP). For example, the production of the matrix 
metalloproteinase stromelysin is associated with malignant tumors with 
metastatic potential (see, e.g., McDonnell et al. (1990) Smnrs. in Cancer 

30 Biology 7:1 07-1 1 5; McDonnell et al. (1 990) Cancer and Metastasis 
Reviews 5:309-319). 
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The capacity of cancer cells to metastasize and invade tissue is 
facilitated by degradation of the basement membrane. Several proteinase 
enzymes, including the MMPs, have been reported to facilitate the 
process of invasion of tumor cells. MMPs are reported to enhance 
5 degradation of the basement membrane, which thereby permits tumorous 
cells to invade tissues. For example, two major metalloproteinases 
having molecular weights of about 70 kDa and 92 kDa appear to enhance 
the ability of tumor cells to metastasize. 

Serine Proteases 

10 Serine proteases (SPs) have been implicated in neoplastic disease 

progression. Most serine proteases, which are either secreted enzymes or 
are sequestered in cytoplasmic storage organelles, have roles in blood 
coagulation, wound healing, digestion, immune responses and tumor 
invasion and metastasis. A class of cell surface proteins designated type 

15 II transmembrane serine proteases, which are membrane-anchored 

proteins with additional extracellular domains, has been identified. As cell 
surface proteins, they are positioned to play a role in intracellular signal 
transduction and in mediating cell surface proteolytic events. Other 
serine proteases can be membrane bound and function in a similar 

20 manner. Others are secreted. Many serine proteases exert their activity 
upon binding to cell surface receptors, and, hence act at cell surfaces. 
Cell surface proteolysis is a mechanism for the generation of biologically 
active proteins that mediate a variety of cellular functions. 

Serine proteases, including secreted and transmembrane serine 

25 proteases, have been implicated in processes involved in neoplastic 

development and progression. While the precise role of these proteases 
has not been fully elaborated, serine proteases and inhibitors thereof are 
involved in the control of many intra- and extracellular physiological 
processes, including degradative actions in cancer cell invasion, 

30 metastatic spread, and neovascularization of tumors, that are involved in 
tumor progression. It is believed that proteases are involved in the 
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degradation and remodeling of extracellular matrix (ECM) and contribute 
to tissue remodeling, and are necessary for cancer invasion and 
metastasis. The activity and/or expression of some proteases have been 
shown to correlate with tumor progression and development. 
5 For example, a membrane-type serine protease MTSP1 (also called 

matriptase; see SEQ ID Nos. 1 and 2 from U.S. Patent No. 5,972,616; 
and GenBank Accession No. AF1 18224; (1999) J. Biol. Chem. 
274:18231-18236; U.S. Patent No. 5,792,616; see, also Takeuchi 
(1999) Proc. Natl. Acad. Sci. U.S.A. 56:11054-1161) that is expressed in 

10 epithelial cancer and normal tissue (Takeucuhi et al. (1999) Proc. Natl. 
Acad. Sci. USA 96 A 1054-61) has been identified. Matriptase was 
originally identified in human breast cancer cells as a major gelatinase 
(see, U.S. Patent No. 5,482,848), a type of matrix metalloproteinase 
(MMP). It has been proposed that it plays a role in the metastasis of 

15 breast cancer. Matriptase also is expressed in a variety of epithelial 
tissues with high levels of activity and/or expression in the human 
gastrointestinal tract and the prostate. MTSPs, designated MTSP3, 
MTSP4, MTSP6 have been described in published International PCT 
application No. WO 01/57194, based in International PCT application No. 

20 PCT/US01/03471. 

Prostate-specific antigen (PSA), a kallikrein-like serine protease, 
degrades extracellular matrix glycoproteins fibronectin and laminin, and, 
has been postulated to facilitate invasion by prostate cancer cells 
(Webber et al. (1995) Clin. Cancer Res. 7:1089-94). Blocking PSA 

25 proteolytic activity with PSA-specific monoclonal antibodies results in a 
dose-dependent decrease in vitro in the invasion of the reconstituted 
basement membrane Matrigel by LNCaP human prostate carcinoma cells 
which secrete high levels of PSA. 

Hepsin, a cell surface serine protease identified in hepatoma cells, 

30 is overexpressed in ovarian cancer (Tanimoto et al. (1997) Cancer Res. , 
57):2884-7). The hepsin transcript appears to be abundant in carcinoma 
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tissue and is almost never expressed in normal adult tissue, including 
normal ovary. It has been suggested that hepsin is frequently 
overexpressed in ovarian tumors and therefore can be a candidate 
protease in the invasive process and growth capacity of ovarian tumor 
5 cells. 

A serine protease-like gene, designated normal epithelial 
cell-specific 1 (NES1) (Liu et al. Cancer Res. 55:3371-3379 (1996)) has 
been identified. Although expression of the NES1 mRNA is observed in 
all normal and immortalized nontumorigenic epithelial cell lines, the 

10 majority of human breast cancer cell lines show a drastic reduction or a 
complete lack of its expression. The structural similarity of NES1 to 
polypeptides known to regulate growth factor activity and a negative 
correlation of NES1 expression with breast oncogenesis suggest a direct 
or indirect role for this protease-like gene product in the suppression of 

15 tumorigenesis. 

Hence transmembrane and other serine proteases and other 
proteases appear to be involved in the etiology and pathogenesis of 
tumors. There is a need to further elucidate their role in these processes 
and to identify additional transmembrane proteases. Therefore, among 

20 the objects herein, it is an object herein to provide serine protease 

proteins and nucleic acids encoding such proteases that are involved in 
the regulation of or participate in tumorigenesis and/or carcinogenesis. It 
also is an object herein to provide prognostic, diagnostic, therapeutic 
screening methods using the such proteases and the nucleic acids 

25 encoding such proteases. 

SUMMARY OF THE INVENTION 

Provided herein are proteins designated CVSP1 6 and protease 
domains thereof. CVSP16 is a member of the serine protease family and 
is expressed or active in breast, colon, lung, prostate, kidney, stomach, 

30 spleen, thyroid gland, trachea and pituitary gland and in tumor tissues 
and cancers, including colon, breast and prostate cancers, and in 
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leukemias and lymphomas. Hence, as a protease it can be involved in 
tumor progression. By virtue of its functional activity it can be a 
therapeutic or diagnostic target. The expression and/or activation (or 
reduction in level of expression or activation) of the expressed protein 
5 (zymogen) of the this protein can be used to monitor cancer and cancer 
therapy. 

The serine protease family includes members that are activated 
and/or expressed in tumor cells at different levels from non-tumor cells; 
and those from cells in which substrates therefor differ in tumor cells 

10 from non-tumor cells or otherwise alter the specificity of the serine 

protease (SP). The serine protease provided herein, designated herein as 
CVSP16, is a secreted protease. Protease domains and full-length 
protein, including the zymogen or inactive forms and activated forms, and 
uses thereof are also provided. Proteins encoded by splice variants are 

15 also provided. Nucleic acid molecules encoding the proteins and protease 
domains are also provided. 

CVSP16 polypeptide is expressed as a secreted protein and may 
bind to cell surface receptors to function as a cell-surface bound protease, 
such as by binding thereto or by dimerization or multimerization with a 

20 membrane-bound or receptor-bound protein. CVSP16 polypeptides are 
serine proteases and exhibit catalytic activity and also can exhibit 
substrate and ligand binding activity. The CVSP16 proteases provided 
herein do not include a contiguous sequence of at 5 least amino acids 
from SEQ ID No. 21 inserted between residues corresponding to Q 660 and 

25 M 661 and/or the CVSP1 6 proteases contains at least two protease 
domains of a serine protease 16 (CVSP16) and includes at least 5 
contiguous amino acids corresponding to residues 508-544 of SEQ ID No. 
6. or contains the contiguous sequence Asn Asp Ser or Trp Asn Asp or 
Ser Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr His or Leu 

30 Gin Thr His in the second protease domain. 
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An exemplified CVSP16 (see, e.g., SEQ ID Nos. 5 and 6, which set 
forth nucleic acid and amino acid sequences thereof), includes two 
protease domains, designated PD1 and PD2, respectively (see, e.g., PD1 
amino acids 46 to 286 (which includes the R that is cleaved upon 
5 activation cleavage), and PD2 including amino acids 323 or 324 or 325 or 
326 to 550 of SEQ ID No. 6). The exemplified CVSP16 contains a signal 
peptide sequence (e.g., aa 1 to aa 23 of SEQ ID No. 6) and a first trypsin- 
like serine protease domain designated herein as CVSP16 PD1 
characterized by the presence of a protease activation cleavage site 

10 (...R 46 4 l 47 VGGSNAQP..., where I indicates protease activation cleavage 
site) at the beginning of the domain and the catalytic triad residues (H 87 , 
D 139 and S 243 ) in 3 highly-conserved regions of the catalytic domain. 

In addition CVSP16 has a second protease domain PD2, in which 
the catalytic histidine is replaced by a serine, indicating that the second 

15 protease domain has lower catalytic activity. The isolated protease 

domains as single chains are provided as are polypeptides that include 
such protease domains. In particular, a polypeptide that contains PD1 as 
the only CVSP1 6 portion is provided. Also provided polyepeptides that 
include PD1 and/or PD2, but do not include at least 5 contiguous amino 

20 acids from SEQ ID No. 21 and polypeptides that include PD1 or PD2, 

particularly polypeptides that include PD1 and/or PD2 as the only CVSP16 
portion. Also included are CVSP16 proteases that contain at least two 
protease domains of a serine protease 16 (CVSP1 6) and include at least 5 
contiguous amino acids corresponding to residues 508-544 of SEQ ID No. 

25 6. or contains the contiguous sequence Asn Asp Ser or Trp Asn Asp or 
Ser Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr His or Leu 
Gin Thr His in the second protease domain. 

Isolated PD1 and/or PD2 are provided as single-chain molecules are 
as activated one, two or three chain molecules. Each of PD1 or PD2 can 

30 exhibit functional activity (catalytic, substrate binding and/or ligand 
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binding activity) without undergoing activation cleavage and/or as single 
chains. 

Nucleic acid encoding the CVSP16 protease and upstream nucleic 
acid is set forth in SEQ ID No. 5; and the encoded protein is set forth in 
5 SEQ ID No. 6. 

The protease domains for use in the methods and assays provided 
herein do not have to result from activation, which produces a two or 
multi-chain activated product, but can be a single-chain polypeptide. 
Such polypeptides, although not the result of activation and not two- 

10 chain forms, can exhibit proteolytic (catalytic) activity. These protease 
domain polypeptides are used in assays to screen for agents that 
modulate an activity of the CVSP1 6. 

Such assays are also provided herein. In exemplary assays, the 
effects of test compounds on the ability of the full length-single chain, 

15 multiple chain activated forms, or a protease domain, which is a single 
chain or a double chain activated form, of CVSP16 to proteolytically 
cleave a known substrate, typically a fluorescently, chromogenically or 
otherwise detectably labeled substrate, are assessed. Agents, generally 
compounds, particularly small molecules, that modulate an activity of the 

20 polypeptide (full length or protease domain(s) either single or double chain 
or multi-chain forms thereof) are candidate compounds for modulating an 
activity of a CVSP16. The protease domains and full length proteins also 
can be used to produce that bind to CVSP16s provided herein as well as 
single-chain protease-specific antibodies and/or multi-chain specific 

25 antibodies. 

The protease domains provided herein include, but are not limited 
to, a single chain region having an N-terminus at a cleavage site for 
activation of a zymogen form, through the C-terminus, or C-terminal 
truncated portions thereof that exhibit proteolytic activity as a single- 
30 chain polypeptide in in vitro proteolysis assays of any CVSP16 provided 
herein, from a mammal, including a human, that, for example, is 
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expressed in tumor cells at different levels from non-tumor cells. Such 
protease domains include single- chains PD1 and PD2 each having an N- 
terminus that corresponds to that resulting from activation cleavage (i.e. 
corresponding to amino acids 47 and 324, respectively of SEQ ID No. 6). 
5 Also provided are truncated versions thereof, particularly, C-terminal 

truncated versions, that exhibit proteolytic and/or substrate binding and/or 
ligand binding activity. 

Also provided are muteins of a single-chain protease domain of 
CVSP16 particularly muteins in which a Cys residue (corresponding to 

10 residue no. 159) in a protease domain that is unpaired (i.e., free; does not 
form disulfide linkages with any other Cys residue within the same 
protease domain) is substituted with another amino acid substitution, 
generally with a conservative amino acid substitution or a substitution 
that does not eliminate an activity, and muteins in which a glycosylation 

15 site(s) is eliminated. Similarly, an unpaired cysteine (corresponding to 
residue no. 430 in SEQ ID No. 6, a residue that does not form disulfide 
linkages with any other Cys residue within the second protease domain) 
similarly can be substituted. Muteins in which other conservative amino 
acid substitutions in which catalytic activity or other functional activity is 

20 retained are also contemplated (see, e.g., Table 1, for exemplary amino 
acid substitutions). 

Hence, provided herein are members of the family of serine 
proteases designated CVSP16, and functional domains, especially 
protease (or catalytic) domains thereof, muteins and other derivatives and 

25 analogs thereof. Also provided herein are nucleic acids encoding the 
CVSP16. 

As noted, the nucleic acid and amino acid sequences of an 
exemplary CVSP16 are set forth in SEQ ID Nos. 5 and 6. Molecules with 
single or a plurality of amino acids insertions, deletions or substitutions 
30 are provides. Nucleic acid molecules that encode a single-chain protease 
domain or catalytically active portion thereof and also those that encode 
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the full-length CVSP16 are provided. Also provided are nucleic acid 
molecules that hybridize to such CVSP16 encoding nucleic acid along at 
least about 70%, 80%, 90%, 95% or more of their full length and 
encode a protease domain or portion thereof. Hybridization is typically 
5 performed under conditions of at least low, generally at least moderate, 
and often high stringency; generally the hybridizing nucleic acid hybridizes 
along at least about 70%, 80%, 90%, 95% of its full length at the 
recited stringency. 

Additionally provided herein are antibodies that specifically bind to 

10 the CVSP16, particularly the CVSP16s provided, including CVSP16s 
provided herein that does not include at least 5, 7, 10, 15, 20 or more 
contiguous amino acids from SEQ ID No. 21 . Also provided are 
antibodies that specifically bind to CVSP16 proteases that contain at least 
two protease domains of a serine protease 16 (CVSP16) and include at 

15 least 5 contiguous amino acids corresponding to residues 508-544 of 
SEQ ID No. 6. or contains the contiguous sequence Asn Asp Ser or Trp 
Asn Asp or Ser Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr 
His or Leu Gin Thr His in the second protease domain. 

Included are antibodies that specifically bind to the protein or 

20 protease domain, including to the single and/or double chain forms 
thereof. Among the antibodies are two-chain-specific antibodies, and 
single-specific antibodies and neutralizing antibodies that inhibit functional 
activity (i.e., catalytic activity and/or substrate or binding activity). Also 
provided are antibodies that bind with at least 2-fold, 5-fold, 10-fold or 

25 100-fold greater affinity to CVSP16 polypeptides that do not include a 
contiguous portion (5, 7, 10, 15, 20 or more amino acid residues) of the 
sequence of amino acids set forth in SEQ ID No. 21 compared to a CVSP 
polypeptide that includes SEQ ID No. 21 . Typically the CVSP16 
polypeptides that do not include the sequence of amino acids set forth in 

30 SEQ ID No. 21 do not include it between residues corresponding to Q 660 
and M 661 (see, SEQ ID No. 6) and/or the CVSP16s polypeptides contain at 
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least two protease domains of a serine protease 16 (CVSP16) and include 
at least 5 contiguous amino acids corresponding to residues 508-544 of 
SEQ ID No. 6. or contains the contiguous sequence Asn Asp Ser or Trp 
Asn Asp or Ser Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr 
5 His or Leu Gin Thr His in the second protease domain.. 

CVSP16 polypeptides, including, but not limited to, splice variants 
thereof, domains, derivatives and analogs thereof are provided herein. 
Single-chain protease domains, where the N-terminal is that which would 
be generated by activation of the zymogen form are provided. Multi- 
10 chain, generally two- or three- chain activated forms of CVSP16 also are 
provided. 

Also provided are cells, combinations, kits and articles of 
manufacture that contain the nucleic acid encoding the CVSP16 and/or 
the CVSP16. Further provided herein are prognostic, diagnostic, 

15 therapeutic screening methods using CVSP16 and the nucleic acids 
encoding CVSP16. Also provided are transgenic non-human animals 
bearing inactivated genes encoding CVSP16 and bearing the genes (or 
inserted cDNA) encoding the CVSP16, particularly under a non-native 
promoter control or on an exogenous element, such as a plasmid or 

20 artificial chromosome, are additionally provided herein. Also provided are 
nucleic acid molecules encoding each of CVSP16 and domains thereof. 

Also provided are plasmids containing any of the nucleic acid 
molecules provided herein. Cells containing the plasmids are also 
provided. Such cells include, but are not limited to, bacterial cells, yeast 

25 cells, fungal cells, plant cells, insect cells and animal cells. 

Also provided is a method of producing CVSP16 by growing the 
above-described cells under conditions whereby the CVSP16 is expressed 
by the cells, and recovering the expressed CVSP16 polypeptide. Methods 
for isolating nucleic acid encoding other CVSP16s are also provided. 

30 Also provided are cells, generally eukaryotic cells, such as 

mammalian cells and yeast cells, in which the CVSP16 polypeptide is 
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expressed by the cells, particularly under conditions in which it is 
secreted. Also provided are cells to which the CVSP16 polypeptide or a 
protease domain thereof is bound. Such cells are used in drug screening 
assays to identify compounds that modulate an activity of the CVSP1 6 
5 polypeptide. These assays include in vitro binding assays, and 

transcription based assays in which signal transduction mediated by the 
bound CVSP16 is assessed. 

The CVSP16 polypeptides (including those that include all or a 
portion of SEQ ID No. 21) are of interest for a variety of reasons. For 

10 example, they appear to be expressed and/or activated at different levels 
in tumor cells from normal cells, or to have functional activity that is 
different in tumor cells from normal cells, such as by an alteration in a 
substrate therefor, or a cofactor. The CVSP16 polypeptides (including 
those that include all or a portion of SEQ ID No. 21) are of interest for a 

15 variety of reasons. For example, they appear to be expressed and/or 
activated at different levels in tumor cells from normal cells, or to have 
functional activity that is different in tumor cells from normal cells, such 
as by an alteration in a substrate therefor, or a cofactor. 

It is shown herein, that CVSP16 is expressed in cervical cancer. It 

20 also may be expressed in colon, breast, stomach, uterine, ovarian, lung 
and prostate tumors and in other tumors as well as in certain normal cells 
and tissues (see e.g., EXAMPLES for an exemplary tissue-specific 
expression profile). The expression and/or activation thereof and/or its 
presence above a predetermined among is a body fluid can be diagnostic 

25 or prognostic of cancer. 

Hence the CVSP16 polypeptides provided herein can serve as 
diagnostic markers for certain tumors, particularly cervical cancers. The 
level of activated CVSP1 6 can be diagnostic of uterine, pancreatic, 
breast, lung, stomach, prostate or colon cancer or leukemia or other 

30 cancer. 
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Also provided herein are methods of modulating an activity of a 
CVSP16 and screening for compounds that modulate, including inhibit, 
antagonize, agonize or otherwise alter an activity of the CVSP16. Of 
particular interest is the proteolytic (catalytic) portion of the protein. 
5 Thus, provided herein are prognostic, diagnostic and therapeutic 

screening methods using the CVSP16 and the nucleic acids encoding 
CVSP16. In particular, the prognostic, diagnostic and therapeutic 
screening methods are used for preventing, treating, or for finding agents 
useful in preventing or treating, tumors or cancers such as uterine, 

10 stomach, lung carcinoma, colon carcinoma and other cancers. 

Methods of diagnosing a disease or disorder characterized by 
detecting an aberrant level of a CVSP16 in a subject is provided. The 
method can be practiced by measuring the level of the DNA, RNA, protein 
or functional activity of the CVSP16. An increase or decrease in the level 

15 of the DNA, RNA, protein or functional activity of the CVSP16, relative to 
the level of the DNA, RNA, protein or functional activity or any other 
suitable control, found in an analogous sample not having the disease or 
disorder (or other suitable control) is indicative of the presence of the 
disease or disorder in the subject. 

20 Also provided are methods for screening for compounds that 

modulate an activity of CVSP1 6. The compounds are identified by 
contacting them with the CVSP16 or protease domain thereof and a 
substrate for the CVSP1 6. A change in the amount of substrate cleaved 
in the presence of the compounds and compared to that in the absence of 

25 the compound indicates that the compound modulates an activity of a 

CVSP16. Such compounds are selected for further analyses or for use to 
modulate the activity of the CVSP16, such as inhibitors or agonists. The 
compounds also can be identified by contacting the substrates with a cell 
that expresses the CVSP1 6 or with the CVSP1 6 or a proteolytically active 

30 portion thereof. 
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Computer based screening methods are also provided. In these 
methods, interactions between simulated test compounds computer 
simulated CVSP16 polypeptides are assessed, such as by computational 
docking or binding studies. Test compounds predicted to bind or 
5 otherwise interact with a CVSP16 polypeptide are selected as drug 

candidates. Further characterization and study, such as in vitro assays, 
can be performed. 

Also provided herein are modulators of the activity of CVSP1 6, 
especially the modulators (i.e., antagonists, agonists, inhibitors), including 

10 antibodies and RNA molecules, and molecules obtained according to the 
screening methods provide herein. Such modulators can have use in 
treating cancerous conditions, and other neoplastic conditions. 

Pharmaceutical compositions containing a protease domain and/or 
full-length or two protease domains or other domain of a CVSP16 

15 polypeptide are provided herein in a pharmaceutical^ acceptable carrier or 
excipient are provided herein. 

Also provided are articles of manufacture that contain CVSP16 
polypeptide and/or a protease domain or protease domains of a CVSP1 6 
as single-chain zymogen and activated forms and as full-length form 

20 zymogen or activated forms and other forms thereof. The articles contain 
a) packaging material; b) a CVSP16 polypeptide (or encoding nucleic 
acid), particularly a polypeptide containing a single chain protease domain 
thereof; and c) a label indicating that the article is for use in assays for 
identifying modulators of the activities of a CVSP1 6 polypeptide or for 

25 use in diagnostic assays. 

Conjugates containing a) a CVSP16 polypeptide or protease domain 
in single chain from; and b) a targeting agent linked to the SP directly or 
via a linker, wherein the agent facilitates: i) affinity isolation or purification 
of the conjugate; ii) attachment of the conjugate to a surface; iii) 

30 detection of the conjugate; or iv) targeted delivery to a selected tissue or 
cell, is provided herein. The conjugate can contain a plurality of agents 
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linked thereto. The conjugate can be a chemical conjugate; and it can be 
a fusion protein. Targeting agents include proteins and peptide 
fragments. The protein or peptide fragment can include a protein binding 
sequence, a nucleic acid binding sequence, a lipid binding sequence, a 
5 polysaccharide binding sequence, or a metal binding sequence. 

Combinations are provided herein. The combination can include: 
a) an inhibitor of an activity of a CVSP16; and b) an anti-cancer treatment 
or agent. The SP inhibitor and the anti-cancer agent can be formulated in 
a single pharmaceutical composition or each is formulated in a separate 

10 pharmaceutical composition. The CVSP16 inhibitor can be an antibody or 
a fragment or binding portion thereof made against the CVSP16, such as 
an antibody that specifically binds to a protease domain, an inhibitor of 
CVSP16 production, or an inhibitor of CVSP16 membrane-localization or 
an inhibitor of CVSP16 activation. Other CVSP16 inhibitors include, but 

15 are not limited to, an antisense nucleic acid encoding the CVSP16, 

particularly a portion of a protease domain, a nucleic acid encoding at 
least a portion of a gene encoding the CVSP16 with a heterologous 
nucleotide sequence inserted therein such that the heterologous sequence 
inactivates the biological activity of the encoded CVSP16 or the gene 

20 encoding it. The portion of the gene encoding the CVSP16 typically will 
flank the heterologous sequence to promote homologous recombination 
with a genomic gene encoding the CVSP16. 

Also, provided are methods for treating or preventing a tumor or 
cancer in a mammal by administering to a mammal an effective amount of 

25 an inhibitor of a CVSP16, whereby the tumor or cancer is treated or 

prevented. The CVSP16 inhibitor used in the treatment or for prophylaxis 
is administered with a pharmaceutical^ acceptable carrier or excipient. 
The mammal treated can be a human. The treatment or prevention 
method can additionally include administering an anti-cancer treatment or 

30 agent simultaneously with or subsequently or before administration of the 
CVSP16 inhibitor. 
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Also provided is a recombinant non-human animal in which an 
endogenous gene of a CVSP16 has been deleted or inactivated by 
homologous recombination or insertional mutagenesis of the animal or an 
ancestor thereof. A recombinant non-human animal is provided herein, 
5 where the gene of a CVSP1 6 is under control of a promoter that is not 
the native promoter of the gene or that is not the native promoter of the 
gene in the non-human animal or where the nucleic acid encoding the 
CVSP16 is heterologous to the non-human animal and the promoter is the 
native or a non-native promoter or the CVSP16 is on an 

10 extrachromosomal element, such as a plasmid or artificial chromosome. 

Also provided are methods of treatments of tumors by adminis- 
tering a prodrug that is activated by CVSP16 that is expressed or active 
in tumor cells, particularly those in which its functional activity in tumor 
cells is greater than in non-tumor cells. The prodrug is administered and, 

15 upon administration, active CVSP16 expressed on cells cleaves the 

prodrug and releases active drug in the vicinity of these cells. The active 
anti-cancer drug accumulates in the vicinity of the tumor. This is 
particularly useful in instances in which CVSP16 is expressed or active in 
greater quantity, higher level or predominantly in tumor cells compared to 

20 other cells. 

Also provided are methods of identifying a compound that binds to 
a form of CVSP16, by contacting a test compound with two or more 
forms; determining to which form or forms the compound binds; and if it 
binds to a form of CVSP16, further determining whether the compound 
25 has at least one of the following properties: 

(i) inhibits activation of a single-chain zymogen form of 

CVSP16; 

(ii) inhibits activity of a form; and/or 

(iii) inhibits dimerization of the protein. 
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The forms can be full length forms zymogen or activated forms, or a 
single or multi-chain protease domain or two protease domains or other 
form resulting from cleavage at an activation site. 

Also provided are methods of diagnosing the presence of a pre- 
5 malignant lesion, a malignancy, or other pathologic condition in a subject, 
by obtaining a biological sample from the subject; exposing it to a 
detectable agent that binds to a multi-chain (i.e., two- or three- chain) or 
single-chain form of CVSP16, where the pathological condition is 
characterized by the presence or absence or amount of a three-chain, 

10 two-chain or single-chain form . 

Methods of inhibiting tumor invasion or metastasis or treating a 
malignant or pre-malignant condition by administering an agent that 
inhibits activation of a zymogen form of CVSP1 6 or an activity of an 
activated form are provided. The conditions include, but are not limited 

15 to, a condition, such as a tumor of the uterus, stomach and also the 
breast, cervix, prostate, esophagus, lung, ovary and colon. 

Methods for monitoring tumor progression and/or therapeutic 
effectiveness are also provided. The levels of activation or expression of 
CVSP16 or a protease domain or domains thereof are assessed, and the 

20 change in the level, reflects tumor progression and/or the effectiveness of 
therapy. Generally, as the tumor progresses the amount of CVSP16 in a 
body tissue or fluid sample increases; effective therapy reduces the level. 
DETAILED DESCRIPTION 
A. DEFINITIONS 

25 Unless defined otherwise, all technical and scientific terms used 

herein have the same meaning as is commonly understood by one of skill 
in the art to which the invention(s) belong. All patents, patent 
applications, published applications and publications, Genbank sequences, 
databases, websites and other published materials referred to throughout 

30 the entire disclosure herein, unless noted otherwise, are incorporated by 
reference in their entirety. In the event that there are a plurality of 
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definitions for terms herein, those in this section prevail. Where 
reference is made to a URL or other such identifier or address, it 
understood that such identifiers can change and particular information on 
the internet can come and go, but equivalent information can be found by 
5 searching the internet. Reference thereto evidences the availability and 
public dissemination of such information. 

As used herein, the abbreviations for any protective groups, amino 
acids and other compounds, are, unless indicated otherwise, in accord 
with their common usage, recognized abbreviations, or the IUPAC-IUB 
10 Commission on Biochemical Nomenclature (see, (1972) Biochem. 
7 7:942-944). 

As used herein, protease refers to an enzyme catalyzing hydrolysis 
of proteins or peptides. It includes zymogen forms and activated single-, 
two- and three-chain forms thereof. For clarity reference to protease 

15 refers to all forms, and particular forms will be specifically designated. 
As used herein, serine protease refers to a diverse family of 
proteases wherein a serine residue is involved in the hydrolysis of proteins 
or peptides. The serine residue can be part of the catalytic triad 
mechanism, which includes a serine, a histidine and an aspartic acid in 

20 the catalysis, or be part of the hydroxyl/e-amine or hydroxyl/a-amine 
catalytic dyad mechanism, which involves a serine and a lysine in the 
catalysis. Of particular interest are SPs of mammalian, including human, 
origin. Those of skill in this art recognize that, in general, single amino 
acid substitutions in non-essential regions of a polypeptide do not 

25 substantially alter biological activity (see, e.g., Watson eta/. (1987) 

Molecular Biology of the Gene, 4th Edition, The Benjamin/Cummings Pub. 
co., p. 224). 

As used herein, "transmembrane serine protease (MTSP)" refers to 
a family of transmembrane serine proteases that share common structural 
30 features as described herein (see, also Hooper et al. (2001) J. Biol. 
Chem. 276:857-860). Thus, reference, for example, to "MTSP" 
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encompasses all proteins encoded by the MTSP gene family, including but 
are not limited to: MTSP3, MTSP4, MTSP6, MTSP7, MTSP9, MTSP10, 
MTSP20 or an equivalent molecule obtained from any other source or that 
has been prepared synthetically or that exhibits the same activity. Other 
5 MTSPs include, but are not limited to, corin, enteropeptidase, human 
airway trypsin-like protease (HAT), MTSP1, TMPRSS2 and TMPRSS4. 
Sequences of encoding nucleic acid molecules and the encoded amino 
acid sequences of exemplary MTSPs and/or domains thereof are set forth, 
for example in U.S. application Serial No. 09/776,191 (SEQ ID Nos. 1-12, 

10 49, 50 and 61-72 therein, published as International PCT application No. 
WO 01/57194; see also published International PCT application Nos. 
WO 02/072786 and WO 02/977267, and International PCT application 
Nos. PCT/US02/21208 and PCT/US02/1 5332). The term also encompass 
MTSPs with amino acid substitutions that do not substantially alter 

15 activity of each member and also encompasses splice variants thereof. 
Suitable substitutions, including, although not necessarily, conservative 
substitutions of amino acids, are known to those of skill in this art and 
can be made without eliminating a biological activity, such as the 
catalytic activity, of the resulting molecule. 

20 As used herein, Type I MTSP refers to transmembrane proteins 

made with an N-terminal signal peptide that is cleaved so that the new N- 
terminus is on the extracytoplasmic side of the membrane. The original 
N-terminus likely stays on the cytoplasmic side, and cleavage occurs on 
the other side of the membrane. These proteins are anchored through a 

25 C-terminal membrane-spanning segment. 

As used herein, Type II MTSP refers to transmembrane proteins 
that are synthesized with N-terminal or internal signal peptides that are 
not cleaved and that serve as a membrane anchor. 

As used herein, a "protease domain of a CVSP", particularly 

30 CVSP16, refers to a domain of a SP that exhibits proteolytic activity and 
shares homology and structural features with the chymotrypsin/trypsin 
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family protease domains. Hence it is at least the minimal portion of the 
domain that exhibits proteolytic activity (or other functional activity, such 
as ligand or substrate binding) as assessed by standard in vitro assays. 
Those of skill in this art recognize that a protease domain is the portion of 
5 the protease that is structurally equivalent to the trypsin or chymotrypsin 
fold. Contemplated herein are polypeptides that include such protease 
domains and catalytically active portions thereof. Also provided are 
truncated forms of a protease domain that include the smallest fragment 
thereof that acts catalytically as a single-chain form. 

10 Thus, as used herein a protease domain of a CVSP16, whenever 

referenced herein, includes at least one or all of or any combination of or 
a catalytically active portion of a CVSP16 polypeptide as defined herein. 
A protease domain of a CVSP1 6 protein refers to a protease domain of a 
CVSP16 that exhibits serine proteolytic activity. Hence it is at least a 

15 minimal portion of the protein that exhibits proteolytic activity (or binds to 
a substrate or ligand) as a single-chain form or as an activated form. as 
assessed by standard assays in vitro. It refers, herein, to a single chain 
form and also multi-chain activated forms. Exemplary protease domains 
include at least a sufficient portion of a sequence of amino acids set forth 

20 in SEQ ID No. 6 (encoded by nucleotides in SEQ ID No. 5) to exhibit 
protease activity. 

Protease domains of CVSPs vary in size and constitution, including 
insertions and deletions in surface loops. They retain conserved 
structure, including at least one of the active site triad, primary 

25 specificity pocket, oxyanion hole and/or other features of serine protease 
domains of proteases. For purposes herein, a protease domain is a 
single chain portion of a CVSP16, as defined herein, but is homologous in 
its structural features and retention of sequence of similarity or homology 
to the protease domain of chymotrypsin or trypsin. Smaller portions of 

30 the protease domains, particularly the single chain domains, thereof that 
retain protease activity are contemplated. Such smaller versions will 
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generally be C-terminal truncated versions of the protease domains. 
Multi-chain, typically two-chain forms, resulting from activation cleavage 
are also contemplated. The polypeptide can exhibit proteolytic activity as 
a single chain. 

5 Thus, for purposes herein, a protease domain is a portion of a 

CVSP16, as defined herein, and shares common features and homologies 
with protease domains of other SPs. As with the larger class of enzymes 
of the chymotrypsin (S1) fold (see, e.g., Internet accessible MEROPS 
database), the CVSP16 protease domains share a high degree of amino 

10 acid sequence identity. The His, Asp and Ser residues necessary for 
activity are present in conserved motifs in PD1 and in modified form in 
PD2. An activation site, whose cleavage creates the N-terminus of the 
protease domain in two-chain forms has a conserved motif and location 
relative to other sites in the domain and can be identified. Domains that 

15 include some of these conserved features, such as the His, but that 

exhibit any proteolytic activity are considered protease domains herein. 
An activation site, which results in the N-terminus of a second chain in 
the multi-chain forms has a conserved motif and/or location within the 
polypeptide. 

20 Exemplary of a protease domain of a CVSP16 is a polypeptide set 

forth as amino acids 46 to 286 or including amino acids 326 to 550, 
particularly 323-550, of SEQ ID No. 6, which can provided as a single- 
chain isolated molecule or as molecule that contains both domains but not 
the full-length CVSP16. The single-chain form can be a zymogen or 

25 activated form in which the N-terminus corresponds to the N-terminus 
produced by activation cleavage 

As used herein, the catalytically active domain of CVSP16 refers to 
the protease domain. Reference to a protease domain of a CVSP16 
includes the single, two- and multi-chain forms of any of these proteins. 

30 A zymogen form of each protein is an inactive single chain form, which 
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can be converted to an active multi-chain form by cleavage. A protease 
domain also can be converted to a multi-chain form. 

Significantly, at least in vitro, the single chain forms of the SPs and 
catalytic domains or proteolytically active portions thereof (typically C- 
5 terminal truncations) thereof exhibit protease activity. Hence provided 
herein are isolated single chain forms of protease domains of SPs and 
their use in in vitro drug screening assays for identification of agents that 
modulate an activity thereof. 

For the protease domains, residues at the N-terminus can be critical 

10 for activity, since it has been shown that the free amino group at the N- 
terminus of such proteases is essential for formation of the catalytically 
active conformation upon activation cleavage of the zymogen form of the 
protease. A protease domain of the single chain form of the CVSP1 6 
protease can be catalytically active. Hence a protease domain generally 

15 requires N-terminal amino acids; the C-terminal portion can be truncated. 
The amount that can be removed can be determined empirically by testing 
the protein for protease activity in an in vitro assay that assesses catalytic 
cleavage. 

Thus, as used herein, the catalytically active domain of a CVSP 
20 refers to a protease domain. Typically this refers to a single-chain form of 
a protease domain. Multi-chain, particularly two chain forms of a 
protease domain also are contemplated; such domains are necessarily 
longer since they include one or more amino acids on the N-terminus side 
of the activation cleavage site as a second chain. A zymogen form of 
25 each protease domain and the protease is a single chain, which is 

converted to an activated, typically a multi-chain form, such as a two or a 
three chain form or an active two or three- chain form by activation 
cleavage of one or both protease domains. Single-chain forms with the 
N-terminus that corresponds to the activation cleavage site can be active 
30 in vitro and are provided. 
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As used herein, by active form is meant a form active in vivo 
and/or in vitro. Single chain forms of the SPs and the catalytic domains 
or proteolytically active portions thereof (typically C -terminal truncations) 
exhibit protease activity. For example, a polypeptide containing the 
5 protease domain can exist as an activated three-chain, two-chain or a 
single chain active form. The active single chain, two chain and three- 
chain forms of a CVSP16 and catalytic domains or proteolytically active 
portions thereof can exhibit protease activity. Among the polypeptides 
provided herein, are isolated single-chain forms, two-chain and three- 

10 chain forms of CVSP1 6 polypeptides that include protease domains and 
their use, for example, in in vitro drug screening assays for identification 
of agents that modulate the activity thereof. 

There are two types of single-chain forms; active and zymogen 
forms. When cleaved at the activation site, there only one single chain 

15 active forms. Any form that results from activation cleavage or that has 
an N-terminus at a site that is the site of cleavage is an active single- 
chain forms. The N-terminus of an activated form must result from 
activation cleavage and be at that site. Any other single-chain form, 
which has a A different N-terminus is a zymogen. 

20 As used herein, a CVSP1 6 whenever referenced herein, includes at 

least one or all of or any combination of: 

a) a polypeptide encoded by the sequence of nucleotides set 
forth in SEQ ID No. 5; 

b) a polypeptide encoded by a sequence of nucleotides that 
25 hybridizes under conditions of low, moderate or high stringency to the 

sequence of nucleotides set forth in SEQ ID No. 5, wherein the encoded 
polypeptide does not include at least 5 or at least 6 or at least 7 or at 
least 8 contiguous or at least 9 or at least 10 contiguous amino acids 
from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly do not 
30 include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6; 
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c) a polypeptide that contains the sequence of amino acids 
set forth in SEQ ID No. 6; 

d) a polyeptide that contains at least two protease domains 
of a serine protease 1 6 (CVSP1 6) and includes at least 5 contiguous 

5 amino acids corresponding to residues 508-544 of SEQ ID No. 6. or 
contains the contiguous sequence Asn Asp Ser or Trp Asn Asp or Ser 
Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr His or Leu Gin 
Thr His in the second protease domain; 

e) a polypeptide that contains a sequence of amino acids 
10 having at least about 60%, 70%, 80%, 90% or about 95% sequence 

identity with the sequence of amino acids set forth in SEQ ID No. 6, 
wherein the polypeptide does not include at least 5 or at least 6 or at 
least 7 or at least 8 contiguous or at least 9 or at least 10 contiguous 
amino acids from SEQ ID No. 21 up to all of SEQ ID No. 21, and 
15 particularly does not include any amino acids therefrom between amino 
acids corresponding to Gln 660 and Met 661 of SEQ ID No. 6; 

f) the polypeptide is encoded by a sequence of 
nucleotides that hybridizes under conditions of at least moderate, and can 
be high, stringency along at least 70% of its full length to a sequence of 

20 nucleotides than encodes a polypeptide of any of a)-e), wherein the 
polypeptide does not include at least 5 or at least 6 or at least 7 or at 
least 8 contiguous or at least 9 or at least 10 contiguous amino acids 
from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly do not 
include any amino acids therefrom between amino acids corresponding to 

25 Gln 660 and Met 661 of SEQ ID No. 6; 

g) the polypeptide has at least 60%, 70%, 80%, 90% or 
about 95% sequence identity with a polypeptide of any of a)-f), wherein 
the polypeptide does not include at least 5 or at least 6 or at least 7 or at 
least 8 contiguous or at least 9 or at least 10 contiguous amino acids 

30 from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly do not 
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include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6; and/or 

h) a polypeptide encoded by a splice variant of a sequence of 
nucleotides that encodes a CVS P 1-6 polypeptide provided herein. Smaller 
5 portions thereof that retain protease activity are contemplated. 

An exemplary CVSP16 polypeptide includes the sequence of amino 
acids set forth in SEQ ID No. 6; the protease domains are included in the 
sequence of amino acids set forth in SEQ ID No. 6. Smaller portions 
thereof that retain protease activity are contemplated. 
10 Included among the CVSP16 polypeptides, as provided herein are 

those that include a protease domain of serine protease 16 (CVSP16) or a 
catalytically active portion thereof, where: 

the polypeptide does not include at least 5 or at least 6 or at least 
7 or at least 8 contiguous or at least 9 or at least 10 contiguous amino 
15 acids from SEQ ID No. 21 , and particularly do not include any amino 

acids therefrom between amino acids corresponding to Gln 660 and Met 661 
of SEQ ID No. 6; 

the polypeptide contains one, two or three chains; and 
a protease domain contains amino acids 46-286 or 326-550 of 
20 SEQ ID No. 6 or amino acids that share at least about 60%, 70%, 80%, 
90% or 95% homology to amino acids 46-286 or 326-550 of SEQ ID 
No. 6. Exemplary of the protease domains are PD1 (aa 46 to aa 286 of 
SEQ ID No. 6); and PD2 (aa 323 to aa 550 SEQ ID No. 6). Any 
polypeptide provided herein that includes PD1 and PD2 of a CVSP16 
25 does not include at least 5, 6, 10, 15 or 20 contiguous amino acids from 
SEQ ID No. 21, and particularly do not include any amino acids therefrom 
between amino acids (if present) corresponding to Gln 660 and Met 661 of 
SEQ ID No. 6. 

The CVSP16 can be from any animal, particularly a mammal, and 
30 includes but are not limited to, primates including humans, gorillas and 
monkeys; rodents, such as mice and rats; fowl, such as chickens; 
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ruminants, such as goats, cows, deer, sheep; ovine, such as pigs and 
other animals. The full length zymogen or a multi-chain activated form is 
contemplated as is any domain thereof, including a protease domain, 
which can be a three-chain activated form, two-chain activated form, or a 
5 single chain activated or zymogen form. An exemplary CVSP16 protein 
includes a sequence of amino acids set forth in SEQ ID No. 6 that 
includes a protease domain or a catalytically active portion thereof. 

CVSP16s of interest include those that are activated and/or ex- 
pressed in tumor cells at a level or amount different, typically higher, from 

10 those in non-tumor cells; and those from cells in which substrates 

therefor differ in tumor cells from non-tumor cells or differ with respect to 
the substrates, co-factors or receptors, or otherwise alter the activity or 
specificity of the MTSP. 

By active form is meant a form that is functionally active in vivo 

15 and/or in vitro. Included among these are single-chain forms of a 

protease domain (i.e., those with an N-terminus corresponding to the 
normal activation cleavage site) that are active in vitro, including in 
assays for protease activity and/or substrate or ligand binding, and are 
used for screening assays, and also can be used as immunogens for 

20 preparing antibodies. 

As used herein, a CVSP16 and/or a protease domain(s) thereof also 
can exist as a two-chain or other multi-chain (i.e., two or three chains) 
activated form. At least in vitro, unactivated and active single chain 
forms of the SPs and other unactivated forms and the catalytic domains 

25 or proteolytically or functionally active portions thereof (typically C- 
terminal truncations) exhibit protease or other functional activity (i.e., 
catalytic activity and/or other activity, such as substrate or ligand 
binding). Hence provided herein are isolated single chain forms of 
protease domains of SPs, and also multi-chain forms, and their use in in 

30 vitro drug screening assays for identification of agents that modulate the 
activity thereof. 
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As used herein, a functionally active CVSP1 6 polypeptide or 
portion thereof exhibits at least one of catalytic activity, substrate binding 
activity or ligand binding activity. As described, for example, PD1 and 
PD2 exhibit catalytic (protease) activity. The CVSP16 polypeptide can be 
5 active in vitro or in vivo. 

Also contemplated are nucleic acid molecules that encode a 
polypeptide that has proteolytic activity in an in vitro proteolysis assay 
and that have at least 80%, 85%, 90% or 95% sequence identity with 
the full length of a protease domain of a CVSP1 6 polypeptide as provided 

10 herein, or that hybridize along at least about 70%, 80%, 90% and 95% 
of their full length to a nucleic acids that encode a protease domain, 
particularly under conditions of moderate, generally high, stringency. 

As used herein, a zymogen form of a full-length CVSP16 poly- 
peptide or a truncated CVSP16 polypeptide containing one or a plurality 

15 of protease domains is one in which at least one of the domains has not 
undergone activation cleavage. A zymogen form refers to a form in 
which at least one protease domain remains unactivated. 

As used herein, activation cleavage refers to the cleavage of the 
protease at the N-terminus of the protease domain (generally between an 

20 R and I or an R and a V or elsewhere as for PD2 herein). By virtue of the 
Cys-Cys pairing between a Cys outside the protease domain and a Cys in 
the protease domain upon cleavage the resulting polypeptide has at least 
two chains. Cleavage can be effected by another protease or 
autocatalytically. In the exemplified CVSP16, the following cysteine 

25 pairings are noted: C 7 2~C 8 3, C 173 -C 2 49, C 2 o 6 -C 228 , C 23 9~C 267 / C 348 -C 36 4, C444- 
C516/ C 472 -C 494 and C 506 -C 534 . In addition, an unpaired cysteine (C 159 ) in 
the first protease domain should pair with C 38 (by unpaired is meant 
unpaired with a Cys in the particular domain). An unpaired cysteine (C 430 ) 
in the second protease domain should pair with C 325 or C 318 . As a result 

30 the protease, upon activation cleavage can contain multiple chains, 
including two or three or more chains, formed by virtue of pairing 
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between the unpaired cysteine in a protease domain with a cysteine 
outside a protease domain. 

As used herein, a two-chain form of a protease domain refers to a 
two-chain form that is formed from a single chain-form following 
5 activation cleavage or other cleavage of the protease. In such forms the 
Cys pairing between, in this instance, a Cys outside a protease domain 
and an upaired Cys in PD1 or PD2, links a protease domain to the 
remainder of the polypeptide and the activation cleavage cleaves the 
chain. A two-chain protease domain form refers to any form in which the 

10 "remainder of the polypeptide" is shortened or cleaved from the full- 
length and includes a Cys from outside a protease domain. The three 
chain form refers to form in which two of the protease domains are 
activated in this manner. 

As used herein, a form of a CVSP16 is one or more of a single 

15 chain form, a two-chain form, a three chain form and/or other multi-chain 
form and the form is activated or is a zymogen or includes one or more 
activated domains. 

As used herein, a human protein is one encoded by nucleic acid, 
such as DNA, present in the genome of a human, including all allelic 

20 variants and conservative variations as long as they are not variants 
found in other mammals. 

As used herein, a "nucleic acid encoding a protease domain or 
catalytically active portion of a SP" refers to a nucleic acid encoding only 
the recited single chain protease domain or active portion thereof, and not 

25 the other contiguous portions of the SP as a continuous sequence. 

As used herein, catalytic activity refers to the activity of the SP as 
a serine protease. Function of the SP refers to its function in tumor 
biology, including promotion of or involvement in initiation, growth or 
progression of tumors, and also roles in signal transduction. Catalytic 

30 activity refers to the activity of the SP as a protease as assessed in in 
vitro proteolytic assays that detect proteolysis of a selected substrate. 
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As used herein, a CUB domain is a motif homologous to domains 
that mediate protein-protein interactions in complement components 
C1r/C1s. CUB domains have been identified in other proteases and 
various proteins involved in developmental processes. 
5 As used herein, a "propeptide" or "pro sequence" is sequence of 

amino acids positioned at the amino terminus of a mature biologically 
active polypeptide. When so-positioned, the resulting polypeptide is 
called a zymogen. Zymogens, generally, are inactive and can be 
converted to mature active polypeptides by catalytic or autocatalytic 
10 cleavage of the propeptide from the zymogen. 

As used herein, a zymogen is an inactive precursor of a proteolytic 
enzyme. Such precursors are generally larger, although not necessarily 
larger than the active form. With reference to serine proteases, 
zymogens are converted to active enzymes by specific cleavage, including 
15 catalytic and autocatalytic cleavage, or by binding of an activating co- 
factor, which generates an active enzyme. A zymogen, thus, is an 
enzymatically inactive protein that is converted to a proteolytic enzyme by 
the action of an activator. Cleavage can be effected autocatalytically. 

As used herein, "disease or disorder" refers to a pathological 
20 condition in an organism resulting from, e.g., infection or genetic defect, 
and characterized by identifiable symptoms. 

As used herein, neoplasm (neoplasia) refers to abnormal new 
growth, and thus means the same as tumor, which can be benign or 
malignant. Unlike hyperplasia, neoplastic proliferation persists even in the 
25 absence of the original stimulus. 

As used herein, neoplastic disease refers to any disorder involving 
cancer, including tumor development, growth, metastasis and 
progression. 

As used herein, cancer is a general term for diseases caused by or 
30 characterized by any type of malignant tumor. 
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As used herein, malignant, as applies to tumors, refers to primary 
tumors that have the capacity of metastasis with loss of growth control 
and positional control. 

As used herein, an anti-cancer agent (used interchangeably with 
5 "anti-tumor or anti-neoplastic agent") refers to any agents used in anti- 
cancer treatment. These include any agents, when used alone or in 
combination with other compounds, that can alleviate, reduce, ameliorate, 
prevent, or place or maintain in a state of remission of clinical symptoms 
or diagnostic markers associated with neoplastic disease, tumors and 

10 cancer, and can be used in methods, combinations and compositions 

provided herein. Non-limiting examples of anti-neoplastic agents include 
anti-angiogenic agents, alkylating agents, antimetabolite, certain natural 
products, platinum coordination complexes, anthracenediones, substituted 
ureas, methylhydrazine derivatives, adrenocortical suppressants, certain 

15 hormones, antagonists and anti-cancer polysaccharides. 

As used herein, a splice variant refers to a variant produced by 
differential processing of a primary transcript of genomic nucleic acid, 
such as DNA, that results in more than one type of mRNA. Polypeptides 
encoded by splice variants of the disclosed nucleic acids encoding 

20 CVSP16 polypeptides are provided herein. 

As used herein, angiogenesis is intended to broadly encompass the 
totality of processes directly or indirectly involved in the establishment 
and maintenance of new vasculature (neovascularization), including, but 
not limited to, neovascularization associated with tumors. 

25 As used herein, anti-angiogenic treatment or agent refers to any 

therapeutic regimen and compound, when used alone or in combination 
with other treatment or compounds, that can alleviate, reduce, 
ameliorate, prevent, or place or maintain in a state of remission of clinical 
symptoms or diagnostic markers associated with undesired and/or 

30 uncontrolled angiogenesis. Thus, for purposes herein an anti-angiogenic 
agent refers to an agent that inhibits the establishment or maintenance of 
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vasculature. Such agents include, but are not limited to, anti-tumor 
agents, and agents for treatments of other disorders associated with 
undesirable angiogenesis, such as diabetic retinopathies, restenosis, 
hyperproHferative disorders and others. 
5 As used herein, non-anti-angiogenic anti-tumor agents refer to anti- 

tumor agents that do not act primarily by inhibiting angiogenesis. 

As used herein, pro-angiogenic agents are agents that promote the 
establishment or maintenance of the vasculature. Such agents include 
agents for treating cardiovascular disorders, including heart attacks and 
10 strokes. 

As used herein, undesired and/or uncontrolled angiogenesis refers 
to pathological angiogenesis wherein the influence of angiogenesis 
stimulators outweighs the influence of angiogenesis inhibitors. As used 
herein, deficient angiogenesis refers to pathological angiogenesis 
15 associated with disorders where there is a defect in normal angiogenesis 
resulting in aberrant angiogenesis or an absence or substantial reduction 
in angiogenesis. 

As used herein, by homologous means about greater than 25% 
nucleic acid sequence identity, such as 25%, 40%, 60%, 70%, 80%, 

20 90% or 95%. If necessary the percentage homology will be specified. 
The terms "homology" and "identity" are often used interchangeably but 
homology for proteins can include conservative amino acid changes. In 
general, sequences (protein or nucleic acid) are aligned so that the 
highest order match is obtained (see, e.g. : Computational Molecular 

25 Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, 
Part I, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 
1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 

30 Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, 1991; Carillo et al. (1988) SIAM J 



-31- 



Applied Math -48:1073). By sequence identity, the number of identical 
amino acids is determined by standard alignment algorithm programs, and 
used with default gap penalties established by each supplier. 
Substantially homologous nucleic acid molecules would hybridize typically 
5 at moderate stringency or at high stringency all along the length of the 
nucleic acid or along at least about 70%, 80% or 90% of the full length 
nucleic acid molecule of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the 
hybridizing nucleic acid molecule. (For proteins, for determination of 
10 homology conservative amino acids can be aligned as well as identical 

amino acids; in this case percentage of identity and percentage homology 
vary). 

Whether any two nucleic acid molecules have nucleotide sequences 
that are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 

15 "identical" can be determined using known computer algorithms such as 
the "FAST A" program, using for example, the default parameters as in 
Pearson et al. (1988) Proc. Natl. Acad. Sci. USA 55:2444 (other 
programs include the GCG program package (Devereux, J., et al., Nucleic 
Acids Research 12f/):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, 

20 S.F., et al., J Molec Biol 275:403 (1 990); Guide to Huge Computers, 

Martin J. Bishop, ed., Academic Press, San Diego, 1994, and Carillo et al. 
( 1 988) SI AM J Applied Math 48: 1 073) . For example, the BLAST function 
of the National Center for Biotechnology Information database can be 
used to determine identity. Other commercially or publicly available 

25 programs include, DNAStar "MegAlign" program (Madison, Wl) and the 
University of Wisconsin Genetics Computer Group (UWG) "Gap" program 
(Madison Wl)). Percent homology or identity of proteins and/or nucleic 
acid molecules can be determined, for example, by comparing sequence 
information using a GAP computer program (e.g., Needleman et al. 

30 (1970) J. Mol. Biol. 45:443, as revised by Smith and Waterman ((1981) 
Adv. Appl. Math. 2:482). Briefly, a GAP program defines similarity as the 
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number of aligned symbols (i.e., nucleotides or amino acids) which are 
similar, divided by the total number of symbols in the shorter of the two 
sequences. Default parameters for the GAP program can include: (1) 
a unary comparison matrix (containing a value of 1 for identities and 0 for 
5 non-identities) and the weighted comparison matrix of Gribskov et a/. 

(1986) Nucl. Acids Res. 14:6745, as described by Schwartz and Dayhoff, 
eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National Bio- 
medical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 
for each gap and an additional 0.10 penalty for each symbol in each gap; 

10 and (3) no penalty for end gaps. Therefore, as used herein, the term 
"identity" represents a comparison between a test and a reference 
polypeptide or polynucleotide. 

As used herein, recitation that amino acids of a polypeptide 
correspond to amino acids in a disclosed sequence, such as amino acids 

15 Q660 and M661 of SEQ ID No. 6, refers to amino acids identified upon 
alignment of the polypeptide with the disclosed sequence to maximize 
identity or homology (where conserved amino acids are aligned) using a 
standard alignment algorithm, such as the GAP algorithm. 

As used herein, the term "at least 90% identical to" refers to 

20 percent identities from 90 to 100% relative to the reference polypeptides. 
Identity at a level of 90% or more is indicative of the fact that, assuming 
for exemplification purposes a test and reference polynucleotide length of 
100 amino acids are compared, no more than 10% (i.e., 10 out of 100) 
of amino acids in the test polypeptide differs from that of the reference 

25 polypeptides. Similar comparisons can be made between a test and 

reference polynucleotides. Such differences can be represented as point 
mutations randomly distributed over the entire length of an amino acid 
sequence or they can be clustered in one or more locations of varying 
length up to the maximum allowable, e.g., 10/100 amino acid difference 

30 (approximately 90% identity). Differences are defined as nucleic acid or 
amino acid substitutions, insertions or deletions. At the level of 
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homologies or identities above about 85-90%, the result should be 
independent of the program and gap parameters set; such high levels of 
identity can be assessed readily, often without relying on software. 

As used herein, primer refers to an oligonucleotide containing two 
5 or more deoxyribonucleotides or ribonucleotides, typically more than 
three, from which synthesis of a primer extension product can be 
initiated. Experimental conditions conducive to synthesis include the 
presence of nucleoside triphosphates and an agent for polymerization and 
extension, such as DNA polymerase, and a suitable buffer, temperature 
10 and pH. 

As used herein, animal includes any animal, such as, but are not 
limited to primates including humans, gorillas and monkeys; rodents, such 
as mice and rats; fowl, such as chickens; ruminants, such as goats, 
cows, deer, sheep; ovine, such as pigs and other animals. Non-human 

15 animals exclude humans as the contemplated animal. The SPs provided 
herein are from any source, animal, plant, prokaryotic and fungal. Most 
CVSP16s are of animal origin, including mammalian origin. 

As used herein, genetic therapy or gene therapy involves the 
transfer of heterologous nucleic acid, such as DNA, into certain cells, 

20 target cells, of a mammal, particularly a human, with a disorder or 

conditions for which such therapy is sought. The nucleic acid, such as 
DNA, is introduced into the selected target cells, such as directly or in a 
vector or other delivery vehicle, in a manner such that the heterologous 
nucleic acid, such as DNA, is expressed and a therapeutic product 

25 encoded thereby is produced. Alternatively, the heterologous nucleic 

acid, such as DNA, can in some manner mediate expression of DNA that 
encodes the therapeutic product, or it can encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, 
expression of a therapeutic product. Genetic therapy also can be used to 

30 deliver nucleic acid encoding a gene product that replaces a defective 

gene or supplements a gene product produced by the mammal or the cell 
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in which it is introduced. The introduced nucleic acid can encode a 
therapeutic compound, such as a growth factor inhibitor thereof, or a 
tumor necrosis factor or inhibitor thereof, such as a receptor therefor, that 
is not normally produced in the mammalian host or that is not produced in 
5 therapeutically effective amounts or at a therapeutically useful time. The 
heterologous nucleic acid, such as DNA, encoding the therapeutic product 
can be modified prior to introduction into the cells of the afflicted host in 
order to enhance or otherwise alter the product or expression thereof. 
Genetic therapy also can involve delivery of an inhibitor or repressor or 

10 other modulator of gene expression. 

As used herein, heterologous nucleic acid is nucleic acid that is not 
normally produced in vivo by the cell in which it is expressed or that is 
produced by the cell but is at a different locus or expressed differently or 
that mediates or encodes mediators that alter expression of endogenous 

15 nucleic acid, such as DNA, by affecting transcription, translation, or other 
regulatable biochemical processes. Heterologous nucleic acid is generally 
not endogenous to the cell into which it is introduced, but has been 
obtained from another cell or prepared synthetically. Heterologous 
nucleic acid can be endogenous, but is nucleic acid that is expressed from 

20 a different locus or altered in its expression. Generally, although not 
necessarily, such nucleic acid encodes RNA and proteins that are not 
normally produced by the cell or in the same way in the cell in which it is 
expressed. Heterologous nucleic acid, such as DNA, also can be referred 
to as foreign nucleic acid, such as DNA. Thus, heterologous nucleic acid 

25 or foreign nucleic acid includes a nucleic acid molecule not present in the 
exact orientation or position as the counterpart nucleic acid molecule, 
such as DNA, is found in a genome. It also can refer to a nucleic acid 
molecule from another organism or species (i.e. , exogenous). 

Any nucleic acid, such as DNA, that one of skill in the art would 

30 recognize or consider as heterologous or foreign to the cell in which the 
nucleic acid is expressed is herein encompassed by heterologous nucleic 
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acid; heterologous nucleic acid includes exogenously added nucleic acid 
that also is expressed endogenously. Examples of heterologous nucleic 
acid include, but are not limited to, nucleic acid that encodes traceable 
marker proteins, such as a protein that confers drug resistance, nucleic 
5 acid that encodes therapeutically effective substances, such as anti- 
cancer agents, enzymes and hormones, and nucleic acid, such as DNA, 
that encodes other types of proteins, such as antibodies. Antibodies that 
are encoded by heterologous nucleic acid can be secreted or expressed 
on the surface of the cell in which the heterologous nucleic acid has been 
10 introduced. 

As used herein, a therapeutically effective product for gene therapy 
is a product that is encoded by heterologous nucleic acid, typically DNA, 
that, upon introduction of the nucleic acid into a host, a product is 
expressed that ameliorates or eliminates the symptoms, manifestations of 
15 an inherited or acquired disease or that cures the disease. Also included 
are biologically active nucleic acid molecules, such as RNAi and 
antisense. 

As used herein, recitation that a polypeptide consists essentially of 
the protease domain means that the only SP portion of the polypeptide is 
20 a protease domain or a catalytically active portion thereof. The 

polypeptide can optionally, and generally will, include additional non-SP- 
derived sequences of amino acids. 

As used herein, cancer or tumor treatment or agent refers to any 
therapeutic regimen and/or compound that, when used alone or in 
25 combination with other treatments or compounds, can alleviate, reduce, 
ameliorate, prevent, or place or maintain in a state of remission of clinical 
symptoms or diagnostic markers associated with deficient angiogenesis. 

As used herein, domain refers to a portion of a molecule, e.g., 
proteins or the encoding nucleic acids, that is structurally and/or 
30 functionally distinct from other portions of the molecule. 
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As used herein, nucleic acids include DNA, RNA and analogs 
thereof, including peptide nucleic acids (PNA) and mixtures thereof. 
Nucleic acids can be single or double-stranded. When referring to probes 
or primers, which are optionally labeled, such as with a detectable label, 
5 such as a fluorescent or radiolabel, single-stranded molecules are 

contemplated. Such molecules are typically of a length such that their 
target is statistically unique or of low copy number (typically less than 5, 
generally less than 3) for probing or priming a library. Generally a probe 
or primer contains at least 14, 16 or 30 contiguous nucleotides of 

10 sequence complementary to or identical to a gene of interest. Probes and 
primers can be 10, 20, 30, 50, 100 or more nucleic acids long. 

As used herein, a probe or primer based on a nucleotide sequence 
disclosed herein, includes at least 10, 14, typically at least 16 contiguous 
nucleotides of SEQ ID No. 5, and probes of at least 30, 50 or 100 

15 contiguous nucleotides of SEQ ID No. 5. The length of the probe or 

primer required for unique hybridization is a function of the complexity of 
the genome of interest. 

As used herein, nucleic acid encoding a fragment or portion of a SP 
refers to a nucleic acid encoding only the recited fragment or portion of a 

20 SP, and not the other contiguous portions of the SP. 

As used herein, operative linkage of heterologous nucleic to 
regulatory and effector sequences of nucleotides, such as promoters, 
enhancers, transcriptional and translational stop sites, and other signal 
sequences refers to the relationship between such nucleic acid, such as 

25 DNA, and such sequences of nucleotides. For example, operative linkage 
of heterologous DNA to a promoter refers to the physical relationship 
between the DNA and the promoter such that the transcription of such 
DNA is initiated from the promoter by an RNA polymerase that specifically 
recognizes, binds to and transcribes the DNA. Thus, operatively linked 

30 or operationally associated refers to the functional relationship of nucleic 
acid, such as DNA, with regulatory and effector sequences of 
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nucleotides, such as promoters, enhancers, transcriptional and 
translational stop sites, and other signal sequences. For example, 
operative linkage of DNA to a promoter refers to the physical and 
functional relationship between the DNA and the promoter such that the 
5 transcription of such DNA is initiated from the promoter by an RNA 

polymerase that specifically recognizes, binds to and transcribes the DNA. 
In order to optimize expression and/or in vitro transcription, it can be 
necessary to remove, add or alter 5' untranslated portions of the clones 
to eliminate extra, potentially inappropriate alternative translation initiation 

10 (i.e., start) codons or other sequences that can interfere with or reduce 
expression, either at the level of transcription or translation. 
Alternatively, consensus ribosome binding sites (see, e.g., Kozak J. Biol. 
Chem. 265:19867-19870 (1991)) can be inserted immediately 5' of the 
start codon and can enhance expression. The desirability of (or need for) 

15 such modification can be empirically determined. 

As used herein, a sequence complementary to at least a portion of 
an RNA, with reference to antisense oligonucleotides, means a sequence 
having sufficient complementarity to be able to hybridize with the RNA, 
generally under moderate or high stringency conditions, forming a stable 

20 duplex; in the case of double-stranded SP antisense nucleic acids, a single 
strand of the duplex DNA (or dsRNA) can thus be tested, or triplex 
formation can be assayed. The ability to hybridize depends on the degree 
of complementarily and the length of the antisense nucleic acid. 
Generally, the longer the hybridizing nucleic acid, the more base 

25 mismatches with a SP encoding RNA it can contain and still form a stable 
duplex (or triplex, as the case can be). One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard procedures to 
determine the melting point of the hybridized complex. 

For purposes herein, amino acid substitutions, deletions and/or 

30 insertions, can be made in any of CVSPs and protease domains thereof 
provided that the resulting protein exhibits protease activity or other 
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activity (or, if desired, such changes can be made to eliminate activity). 
Muteins can be made by making conservative amino acid substitutions 
and also non-conservative amino acid substitutions. For example, amino 
acid substitutions that desirably or advantageously alter properties of the 
5 proteins can be made. In one embodiment, mutations that prevent 

degradation of the polypeptide can be made. Many proteases cleave after 
basic residues, such as R and K; to eliminate such cleavage, the basic 
residue is replaced with a non-basic residue. Interaction of the protease 
with an inhibitor can be blocked while retaining catalytic activity by 
10 effecting a non-conservative change at the site of interaction of the 
inhibitor with the protease. Other activities also can be altered. For 
example, receptor binding can be altered without altering catalytic 
activity. 

Amino acid substitutions contemplated include conservative 

15 substitutions, such as those set forth in Table 1, which do not eliminate 
proteolytic activity. As described herein, substitutions that alter 
properties of the proteins, such as removal of cleavage sites and other 
such sites are also contemplated; such substitutions are generally non- 
conservative, but can be readily effected by those of skill in the art. 

20 Suitable conservative substitutions of amino acids are known to 

those of skill in this art and can be made generally without altering the 
biological activity, for example enzymatic activity, of the resulting 
molecule. Those of skill in this art recognize that, in general, single amino 
acid substitutions in non-essential regions of a polypeptide do not 

25 substantially alter biological activity (see, e.g., Watson eta/. Molecular 
Biology of the Gene, 4th Edition, 1987, The Benjamin/Cummings Pub. 
co., p. 224). Also included within the definition, is the catalytically active 
fragment of a SP, particularly a single chain protease portion. 
Conservative amino acid substitutions are made, for example, in 

30 accordance with those set forth in TABLE 1 as follows: 
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TABLE 1 



Original residue 


Conservative substitution 


Ala (A) 


Gly; Ser, Abu 


Arg (R) 


Lys, orn 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


Me (I) 


Leu; Val; Met; Nle; Nva 


Leu (L) 


lie; Val; Met; Nle; Nv 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; Me; NLe Val 


Ornithine 


Lys; Arg 


Phe (F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


Me; Leu; Met; Nle; Nv 



Other substitutions are also permissible and can be determined empirically 
or in accord with known conservative substitutions. 

As used herein, Abu is 2-aminobutyric acid; Orn is ornithine. 

25 As used herein, the amino acids, which occur in the various amino 

acid sequences appearing herein, are identified according to their well- 
known, three-letter or one-letter abbreviations. The nucleotides, which 
occur in the various DNA fragments, are designated with the standard 
single-letter designations used routinely in the art. 

30 As used herein, amelioration of the symptoms of a particular 

disorder by administration of a particular pharmaceutical composition 
refers to any lessening, whether permanent or temporary, lasting or 
transient that can be attributed to or associated with administration of the 
composition. 

35 As used herein, antisense polynucleotides refer to synthetic 

sequences of nucleotide bases complementary to mRNA or the sense 
strand of double-stranded DNA. Admixture of sense and antisense 
polynucleotides under appropriate conditions leads to the binding of the 
two molecules, or hybridization. When these polynucleotides bind to 
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(hybridize with) mRNA, inhibition of protein synthesis (translation) occurs. 
When these polynucleotides bind to double-stranded DNA, inhibition of 
RNA synthesis (transcription) occurs. The resulting inhibition of 
translation and/or transcription leads to an inhibition of the synthesis of 
5 the protein encoded by the sense strand. Antisense nucleic acid 
molecules typically contain a sufficient number of nucleotides to 
specifically bind to a target nucleic acid, generally at least 5 contiguous 
nucleotides, often at least 14 or 16 or 30 contiguous nucleotides or 
modified nucleotides complementary to the coding portion of a nucleic 

10 acid molecule that encodes a gene of interest, for example, nucleic acid 
encoding a single chain protease domain of a SP. 

As used herein, an array refers to a collection of elements, such as 
antibodies, containing two, three or more members. An addressable array 
is one in which the members of the array are identifiable, typically by 

15 position on a solid phase support. Hence, in general the members of the 
array are immobilized on discrete identifiable loci on the surface of a solid 
phase. 

As used herein, antibody refers to an immunoglobulin, whether 
natural or partially or wholly synthetically produced, including any 

20 derivative thereof that retains the specific binding ability of the antibody. 
Hence antibody includes any protein having a binding domain that is 
homologous or substantially homologous to an immunoglobulin binding 
domain. Antibodies include members of any immunoglobulin class, 
including IgG, IgM, IgA, IgD and IgE. 

25 As used herein, antibody fragment refers to any derivative of an 

antibody that is less then full length, retaining at least a portion of the 
full-length antibody's specific binding ability. Examples of antibody 
fragments include, but are not limited to, Fab, Fab', F(ab) 2 , single-chain 
Fvs (scFV), FV, dsFV diabody and Fd fragments. The fragment can 

30 include multiple chains linked together, such as by disulfide bridges. An 
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antibody fragment generally contains at least about 50 amino acids and 
typically at least 200 amino acids. 

As used herein, a Fv antibody fragment is composed of one 
variable heavy chain domain (V H ) and one variable light chain domain 
5 linked by noncovalent interactions. 

As used herein, a dsFV refers to an Fv with an engineered 
intermolecular disulfide bond, which stabilizes the V H -V L pair. 

As used herein, a F(ab) 2 fragment is an antibody fragment that 
results from digestion of an immunoglobulin with pepsin at pH 4.0-4.5; it 

10 can be recombinantly produced to produce the equivalent fragment. 

As used herein, Fab fragments are antibody fragments that result 
from digestion of an immunoglobulin with papain; it can be recombinantly 
produced to produce the equivalent fragment. 

As used herein, scFVs refer to antibody fragments that contain a 

15 variable light chain (V L ) and variable heavy chain (V H ) covalently 

connected by a polypeptide linker in any order. The linker is of a length 
such that the two variable domains are bridged without substantial 
interference. Included linkers are (Gly-Ser) n residues with some Glu or Lys 
residues dispersed throughout to increase solubility. 

20 As used herein, humanized antibodies refer to antibodies that are 

modified to include human sequences of amino acids so that 
administration to a human does not provoke an immune response. 
Methods for preparation of such antibodies are known. For example, to 
produce such antibodies, the encoding nucleic acid in the hybridoma or 

25 other prokaryotic or eukaryotic cell, such as an E. coli or a CHO cell, that 
expresses the monoclonal antibody is altered by recombinant nucleic acid 
techniques to express an antibody in which the amino acid composition of 
the non-variable region is based on human antibodies. Computer 
programs have been designed to identify such non-variable regions. 

30 As used herein, diabodies are dimeric scFV; diabodies typically 

have shorter peptide linkers than scFvs, and they generally dimerize. 
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As used herein, production by recombinant means by using 
recombinant DNA methods means the use of the well known methods of 
molecular biology for expressing proteins encoded by cloned DNA. 
As used herein the term assessing is intended to include 
5 quantitative and qualitative determination in the sense of obtaining an 

absolute value for the activity of a SP, or a domain thereof, present in the 
sample, and also of obtaining an index, ratio, percentage, visual or other 
value indicative of the level of the activity. Assessment can be direct or 
indirect and the chemical species actually detected need not of course be 

10 the proteolysis product itself but can for example be a derivative thereof 
or some further substance. 

As used herein, biological activity refers to the in vivo activities of 
a compound or physiological responses that result upon in vivo 
administration of a compound, composition or other mixture. Biological 

15 activity, thus, encompasses therapeutic effects and pharmaceutical 
activity of such compounds, compositions and mixtures. Biological 
activities can be observed in in vitro systems designed to test or use such 
activities. Thus, for purposes herein a biological activity of a protease its 
catalytic activity in which a polypeptide is hydrolyzed. 

20 As used herein, functional activity refers to an activity or activities 

of a polypeptide or portion thereof associated with a full-length (complete) 
protein. Functional activities include, but are not limited to, biological 
activity, catalytic or enzymatic activity, antigenicity (ability to bind to or 
compete with a polypeptide for binding to an anti-polypeptide antibody), 

25 immunogenicity, ability to form multimers, and the ability to specifically 
bind to a receptor or ligand for the polypeptide. 

As used herein, a conjugate refers to the compounds provided 
herein that include one or more SPs, including a CVSP16, particularly 
single chain protease domains thereof, and one or more targeting agents. 

30 These conjugates include those produced by recombinant means as fusion 
proteins, those produced by chemical means, such as by chemical 



-43- 



24745-1625 



coupling, through, for example, coupling to sulfhydryl groups, and those 
produced by any other method whereby at least one SP, or a domain 
thereof, is linked, directly or indirectly via linker(s) to a targeting agent. 

As used herein, a targeting agent, is any moiety, such as a protein 
5 or effective portion thereof, that provides specific binding of the 
conjugate to a cell surface receptor, which in some instances can 
internalize bound conjugates or portions thereof. A targeting agent also 
can be one that promotes or facilitates, for example, affinity isolation or 
purification of the conjugate; attachment of the conjugate to a surface; or 

10 detection of the conjugate or complexes containing the conjugate. 

As used herein, an antibody conjugate refers to a conjugate in 
which the targeting agent is an antibody. 

As used herein, derivative or analog of a molecule refers to a 
portion derived from or a modified version of the molecule. 

15 As used herein, an effective amount of a compound for treating a 

particular disease is an amount that is sufficient to ameliorate, or in some 
manner reduce the symptoms associated with the disease. Such an 
amount can be administered as a single dosage or can be administered 
according to a regimen, whereby it is effective. The amount can cure the 

20 disease but, typically, is administered in order to ameliorate the symptoms 
of the disease. Repeated administration can be required to achieve the 
desired amelioration of symptoms. 

As used herein equivalent, when referring to two sequences of 
nucleic acids, means that the two sequences in question encode the same 

25 sequence of amino acids or equivalent proteins. When equivalent is used 
in referring to two proteins or peptides, it means that the two proteins or 
peptides have substantially the same amino acid sequence with only 
amino acid substitutions (such as, but not limited to, conservative 
changes such as those set forth in Table 1 , above) that do not 

30 substantially alter the activity or function of the protein or peptide. When 
equivalent refers to a property, the property does not need to be present 



-44- 



to the same extent (e.g., two peptides can exhibit different rates of the 
same type of enzymatic activity), but the activities are usually 
substantially the same. Complementary, when referring to two nucleotide 
sequences, means that the two sequences of nucleotides are capable of 
5 hybridizing, typically with less than 25%, 15% or 5% mismatches 
between opposed nucleotides. If necessary, the percentage of 
complementarity will be specified. Typically the two molecules are 
selected such that they will hybridize under conditions of high stringency. 
As used herein, an agent that modulates the activity of a protein or 

10 expression of a gene or nucleic acid either decreases or increases or 
otherwise alters the activity of the protein or, in some manner, up- or 
down-regulates or otherwise alters expression of the nucleic acid in a cell. 

As used herein, inhibitor of the activity of a SP encompasses any 
substances that prohibit or decrease production, post-translational 

15 modification(s), maturation, or membrane localization of the SP or any 
substance that interferes with or decreases the proteolytic efficacy 
thereof, particularly of a single chain form in an in vitro screening assay. 

As used herein, a method for treating or preventing neoplastic 
disease means that any of the symptoms, such as the tumor, metastasis 

20 thereof, the vascularization of the tumors or other parameters by which 
the disease is characterized are reduced, ameliorated, prevented, placed 
in a state of remission, or maintained in a state of remission. It also 
means that the hallmarks of neoplastic disease and metastasis can be 
eliminated, reduced or prevented by the treatment. Non-limiting examples 

25 of the hallmarks include uncontrolled degradation of the basement 
membrane and proximal extracellular matrix, migration, division, and 
organization of the endothelial cells into new functioning capillaries, and 
the persistence of such functioning capillaries. 

As used herein, pharmaceutical^ acceptable salts, esters or other 

30 derivatives of the conjugates include any salts, esters or derivatives that 
can be readily prepared by those of skill in this art using known methods 
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for such derivatization and that produce compounds that can be 
administered to animals or humans without substantial toxic effects and 
that either are pharmaceutical^ active or are prodrugs. 

As used herein, a prodrug is a compound that, upon in vivo 
5 administration, is metabolized or otherwise converted to the biologically, 
pharmaceutical^ or therapeutically active form of the compound. To 
produce a prodrug, the pharmaceutical^ active compound is modified 
such that the active compound is regenerated by metabolic processes. 
The prodrug can be designed to alter the metabolic stability or the 

10 transport characteristics of a drug, to mask side effects or toxicity, to 

improve the flavor of a drug or to alter other characteristics or properties 
of a drug. By virtue of knowledge of pharmacodynamic processes and 
drug metabolism in vivo, those of skill in this art, once a pharmaceutical^ 
active compound is known, can design prodrugs of the compound (see, 

15 e.g., Nogrady (1985) Medicinal Chemistry A Biochemical Approach, 
Oxford University Press, New York, pages 388-392). 

As used herein, a drug identified by the screening methods 
provided herein refers to any compound that is a candidate for use as a 
therapeutic or as a lead compound for the design of a therapeutic. Such 

20 compounds can be small molecules, including small organic molecules, 

peptides, peptide mimetics, antisense molecules or dsRNA, such as RNAi, 
antibodies, fragments of antibodies, recombinant antibodies and other 
such compound which can serve as drug candidate or lead compound. 
As used herein, a peptidomimetic is a compound that mimics the 

25 conformation and certain stereochemical features of the biologically active 
form of a particular peptide. In general, peptidomimetics are designed to 
mimic certain desirable properties of a compound, but not the undesirable 
properties, such as flexibility, that lead to a loss of a biologically active 
conformation and bond breakdown. Peptidomimetics can be prepared 

30 from biologically active compounds by replacing certain groups or bonds 
that contribute to the undesirable properties with bioisosteres. 
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Bioisosteres are known to those of skill in the art. For example the 
methylene bioisostere CH 2 S has been used as an amide replacement in 
enkephalin analogs (see, e.g., Spatola (1983) pp. 267-357 in Chemistry 
and Biochemistry o f Amino Acids, Peptides, and Proteins, Weinstein, Ed. 
5 volume 7, Marcel Dekker, New York). Morphine, which can be 

administered orally, is a compound that is a peptidomimetic of the peptide 
endorphin. For purposes herein, cyclic peptides are included among 
peptidomimetics. 

As used herein, a promoter region or promoter element refers to a 

10 segment of DNA or RNA that controls transcription of the DNA or RNA to 
which it is operatively linked. The promoter region includes specific 
sequences that are sufficient for RNA polymerase recognition, binding and 
transcription initiation. This portion of the promoter region is referred to 
as the promoter. In addition, the promoter region includes sequences that 

15 modulate this recognition, binding and transcription initiation activity of 
RNA polymerase. These sequences can be cis acting or can be 
responsive to trans acting factors. Promoters, depending upon the nature 
of the regulation, can be constitutive or regulated. Exemplary promoters 
contemplated for use in prokaryotes include the bacteriophage T7 and T3 

20 promoters. 

As used herein, a receptor refers to a molecule that has an affinity 
for a given ligand. Receptors can be naturally-occurring or synthetic 
molecules. Receptors also can be referred to in the art as anti-ligands. 
As used herein, the receptor and anti-ligand are interchangeable. 

25 Receptors can be used in their unaltered state or bound to other 

polypeptides, including as homodimers. Receptors can be attached to, 
covalently or noncovalently, or in physical contact with, a binding 
member, either directly or indirectly via a specific binding substance or 
linker. Examples of receptors, include, but are not limited to: antibodies, 

30 cell membrane receptors surface receptors and internalizing receptors, 
monoclonal antibodies and antisera reactive with specific antigenic 
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determinants [such as on viruses, cells, or other materials], drugs, 
polynucleotides, nucleic acids, peptides, cofactors, lectins, sugars, 
polysaccharides, cells, cellular membranes, and organelles. 

Examples of receptors and applications using such receptors, 
5 include but are not restricted to: 

a) enzymes: specific transport proteins or enzymes essential to 
survival of microorganisms, which could serve as targets for antibiotic 
[ligand] selection; 

b) antibodies: identification of a ligand-binding site on the antibody 
10 molecule that combines with the epitope of an antigen of interest can be 

investigated; determination of a sequence that mimics an antigenic 
epitope can lead to the development of vaccines of which the immunogen 
is based on one or more of such sequences or lead to the development of 
related diagnostic agents or compounds useful in therapeutic treatments 
15 such as for auto-immune diseases 

c) nucleic acids: identification of ligand, such as protein or RNA, 
binding sites; 

d) catalytic polypeptides: polymers, including polypeptides, that 
are capable of promoting a chemical reaction involving the conversion of 

20 one or more reactants to one or more products; such polypeptides 
generally include a binding site specific for at least one reactant or 
reaction intermediate and an active functionality proximate to the binding 
site, in which the functionality is capable of chemically modifying the 
bound reactant (see, e.g., U.S. Patent No. 5,215,899); 

25 e) hormone receptors: determination of the ligands that bind with 

high affinity to a receptor is useful in the development of hormone 
replacement therapies; for example, identification of ligands that bind to 
such receptors can lead to the development of drugs to control blood 
pressure; and 
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f) opiate receptors: determination of ligands that bind to the opiate 
receptors in the brain is useful in the development of less-addictive 
replacements for morphine and related drugs. 

As used herein, sample refers to anything which can contain an 
5 analyte for which an analyte assay is desired. The sample can be a 
biological sample, such as a biological fluid or a biological tissue. 
Examples of biological fluids include urine, blood, plasma, serum, saliva, 
semen, stool, sputum, cerebral spinal fluid, tears, mucus, amniotic fluid or 
the like. Biological tissues are aggregates of cells, usually of a particular 

10 kind together with their intercellular substance that form one of the 
structural materials of a human, animal, plant, bacterial, fungal or viral 
structure, including connective, epithelium, muscle and nerve tissues. 
Examples of biological tissues also include organs, tumors, lymph nodes, 
arteries and individual cell(s). 

15 As used herein: stringency of hybridization in determining 

percentage mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50°C 

20 Those of skill in this art know that the washing step selects for 

stable hybrids and also know the ingredients of SSPE (see, e.g., 
Sambrook, E.F. Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press (1989), vol. 3, p. B.13, see, 
also, numerous catalogs that describe commonly used laboratory 

25 solutions). SSPE is pH 7.4 phosphate-buffered 0.18 M NaCI. Further, 

those of skill in the art recognize that the stability of hybrids is determined 
by T m , which is a function of the sodium ion concentration and 
temperature (T m = 81 .5° C-1 6.6(log 10 [Na + ]) + 0.41 (%G + O-600/D), so 
that the only parameters in the wash conditions critical to hybrid stability 

30 are sodium ion concentration in the SSPE (or SSC) and temperature. 
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It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows 
(see also Shilo and Weinberg, Proc. Natl. Acad. Sci. USA 75:6789-6792 
5 (1981)): Filters containing DNA are pretreated for 6 hours at 40°C in a 
solution containing 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 
5 mM EDTA, 0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 //g/ml denatured 
salmon sperm DNA (10X SSC is 1 .5 M sodium chloride, and 0.15 M 
sodium citrate, adjusted to a pH of 7). 

10 Hybridizations are carried out in the same solution with the 

following modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 
//g/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 
cpm 32 P-labeled probe is used. Filters are incubated in hybridization 
mixture for 18-20 hours at 40°C, and then washed for 1.5 hours at 55°C 

15 in a solution containing 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, 
and 0.1% SDS. The wash solution is replaced with fresh solution and 
incubated an additional 1 .5 hours at 60°C. Filters are blotted dry and 
exposed for autoradiography. If necessary, filters are washed for a third 
time at 65-68°C and reexposed to film. Other conditions of low 

20 stringency which can be used are well known in the art (e.g., as 
employed for cross-species hybridizations). 

By way of example and not way of limitation, procedures using 
conditions of moderate stringency include, for example, but are not 
limited to, procedures using such conditions of moderate stringency are 

25 as follows: Filters containing DNA are pretreated for 6 hours at 55°C in a 
solution containing 6X SSC, 5X Denhart's solution, 0.5% SDS and 100 
//g/ml denatured salmon sperm DNA. Hybridizations are carried out in the 
same solution and 5-20 X 10 6 cpm 32 P-labeled probe is used. Filters are 
incubated in hybridization mixture for 18-20 hours at 55°C, and then 

30 washed twice for 30 minutes at 60°C in a solution containing IX SSC 
and 0.1% SDS. Filters are blotted dry and exposed for autoradiography. 
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Other conditions of moderate stringency which can be used are well- 
known in the art. Washing of filters is done at 37°C for 1 hour in a 
solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using 
5 conditions of high stringency are as follows: Prehybridization of filters 
containing DNA is carried out for 8 hours to overnight at 65 °C in buffer 
composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.02% BSA, and 500 jug/ml denatured salmon sperm 
DNA. Filters are hybridized for 48 hours at 65 °C in prehybridization 

10 mixture containing 100 jjg/m\ denatured salmon sperm DNA and 5-20 X 
10 6 cpm of 32 P-labeled probe. Washing of filters is done at 37°C for 
1 hour in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 
0.01 % BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 
minutes before autoradiography. Other conditions of high stringency 

15 which can be used are well known in the art. 

The term substantially identical or homologous or similar varies 
with the context as understood by those skilled in the relevant art and 
generally means at least 60% or 70%, preferably means at least 80%, 
more preferably at least 90%, and most preferably at least 95% identity. 

20 As used herein, substantially identical to a product means 

sufficiently similar so that the property of interest is sufficiently 
unchanged so that the substantially identical product can be used in place 
of the product. 

As used herein, substantially pure means sufficiently homogeneous 
25 to appear free of readily detectable impurities as determined by standard 
methods of analysis, such as thin layer chromatography (TLC), gel 
electrophoresis and high performance liquid chromatography (HPLC), used 
by those of skill in the art to assess such purity, or sufficiently pure such 
that further purification would not detectably alter the physical and 
30 chemical properties, such as enzymatic and biological activities, of the 
substance. Methods for purification of the compounds to produce 
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substantially chemically pure compounds are known to those of skill in 
the art. A substantially chemically pure compound can, however, be a 
mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 
5 As used herein, target cell refers to a cell that expresses a SP in 

vivo. 

As used herein, test substance (or test compound) refers to a 
chemically defined compound (e.g. , organic molecules, inorganic 
molecules, organic/inorganic molecules, proteins, peptides, nucleic acids, 

10 oligonucleotides, lipids, polysaccharides, saccharides, or hybrids among 
these molecules such as glycoproteins, and others) or mixtures of 
compounds (e.g., a library of test compounds, natural extracts or culture 
supernatants and others) whose effect on a SP, particularly a single chain 
form that includes the protease domain or a sufficient portion thereof for 

15 activity, as determined by an in vitro method, such as the assays 
provided herein, is tested. 

As used herein, a molecule, such as an antibody, that specifically 
binds to a polypeptide typically has a binding affinity (K a ) of at least about 
10 6 l/mol, 10 7 l/mol, 10 8 l/mol, 10 9 l/mol, 10 10 l/mol or greater and binds 

20 to a protein of interest generally with at least 2-fold, 5-fold, generally 10- 
fold or even 100-fold or greater, affinity than to other proteins. For 
example, an antibody that specifically binds to the protease domain 
compared to the full-length molecule, such as the zymogen form, binds 
with at least about 2-fold, typically 5-fold or 10-fold higher affinity, to a 

25 polypeptide that contains only the protease domain than to the zymogen 
form of the full-length. Such specific binding also is referred to as 
selective binding. Thus, specific or selective binding refers to greater 
binding affinity (generally at least 2-fold, 5-fold, 10-fold or more) to a 
targeted site or locus compared to a non-targeted site or locus. 

30 As used herein, the terms a therapeutic agent, therapeutic regimen, 

radioprotectant, or chemotherapeutic mean conventional drugs and drug 
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therapies, including vaccines, which are known to those skilled in the art. 
Radiotherapeutic agents are well known in the art. 

As used herein, treatment means any manner in which the symp- 
toms of a condition, disorder or disease are ameliorated or otherwise 
5 beneficially altered. Treatment also encompasses any pharmaceutical use 
of the compositions herein. 

As used herein, vector (or plasmid) refers to discrete elements that 
are used to introduce heterologous nucleic acid into cells for either 
expression or replication thereof. The vectors typically remain episomal, 

10 but can be designed to effect integration of a gene or portion thereof into 
■ a chromosome of the genome. Also contemplated are vectors that are 
artificial chromosomes, such as yeast artificial chromosomes and 
mammalian artificial chromosomes. Selection and use of such vehicles are 
well known to those of skill in the art. An expression vector includes 

15 vectors capable of expressing DNA that is operatively linked with 
regulatory sequences, such as promoter regions, that are capable of 
effecting expression of such DNA fragments. Thus, an expression vector 
refers to a recombinant DNA or RNA construct, such as a plasmid, a 
phage, recombinant virus or other vector that, upon introduction into an 

20 appropriate host cell, results in expression of the cloned DNA. 

Appropriate expression vectors are well known to those of skill in the art 
and include those that are replicable in eukaryotic cells and/or prokaryotic 
cells and those that remain episomal or those which integrate into the 
host cell genome. 

25 As used herein, protein binding sequence refers to a protein or 

peptide sequence that is capable of specific binding to other protein or 
peptide sequences generally, to a set of protein or peptide sequences or 
to a particular protein or peptide sequence. 

As used herein, epitope tag refers to a short stretch of amino acid 

30 residues corresponding to an epitope to facilitate subsequent biochemical 
and immunological analysis of the epitope tagged protein or peptide. 
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Epitope tagging is achieved by adding the sequence of the epitope tag to 

a protein-encoding sequence in an appropriate expression vector. Epitope 

tagged proteins can be affinity purified using highly specific antibodies 

raised against the tags. 
5 As used herein, metal binding sequence refers to a protein or 

peptide sequence that is capable of specific binding to metal ions 

generally, to a set of metal ions or to a particular metal ion. 

As used herein, a combination refers to any association between 

two or among more items. 
10 As used herein, a composition refers to any mixture. It can be a 

solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or 

any combination thereof. 

As used herein, fluid refers to any composition that can flow. 

Fluids thus encompass compositions that are in the form of semi-solids, 
15 pastes, solutions, aqueous mixtures, gels, lotions, creams and other such 

compositions. 

As used herein, a kit is a packaged combination optionally including 
instructions for use of the combination and/or other reactions and 
components for such use. 

20 As used herein, a cellular extract refers to a preparation or fraction 

which is made from a lysed or disrupted cell. 

As used herein, an agent is said to be randomly selected when the 
agent is chosen randomly without considering the specific sequences 
involved in the association of a protein alone or with its associated 

25 substrates, binding partners and/or other components. An example of 
randomly selected agents is the use a chemical library or a peptide 
combinatorial library, or a growth broth of an organism or conditioned 
medium. 

As used herein, an agent is said to be rationally selected or 
30 designed when the agent is chosen on a non-random basis which takes 
into account the sequence of the target site and/or its conformation in 
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connection with the agent's action. As described in the Examples, there 
are proposed binding sites for serine protease and (catalytic) sites in the 
protein having SEQ ID No. 6. Agents can be rationally selected or 
rationally designed by using the peptide sequences that make up these 
5 sites. 

For clarity of disclosure, and not by way of limitation, the detailed 
description is divided into the subsections that follow. 

B. CVSP16 POLYPEPTIDES, MUTEINS, DERIVATIVES AND ANALOGS 
THEREOF 

10 SPs 

The serine proteases (SPs) are a family of proteins found in 
mammals and also other species. SPs share a number of common 
structural features as described herein. The proteolytic domains share 
sequence homology generally including conserved His, Asp, and Ser 
15 residues necessary for efficient catalytic activity that are present in 

conserved motifs. These SPs generally are synthesized as zymogens, and 
activated to multi-chain forms (i.e., two-chains, three-chains) by specific 
cleavage. 

The SP family can be targeted for therapeutic intervention and also 
20 can serve as diagnostic markers for tumor initiation, development, growth 
and/or progression. As discussed, members of this family are involved in 
proteolytic processes that are implicated in tumor development, growth 
and/or progression. This implication is based upon their functions as 
proteolytic enzymes in extracellular matrix degradation and remodelling 
25 and growth- and pro-angiogenic factor activation. In addition, their levels 
of expression or level of activation or their apparent activity resulting from 
substrate and/or co-factor levels or alterations in substrates and/or co- 
factors and levels thereof differ in tumor cells from non-tumor cells in the 
same tissue. Hence, protocols and treatments that alter their activity, 
30 such as their proteolytic activities and roles in signal transduction, and/or 
their expression, such as by contacting them with a compound that 
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modulates their activity and/or expression, could impact tumor 
development, growth and/or progression. Also, in some instances, the 
level of activation and/or expression can be altered in tumors, such as 
pancreas, stomach, uterus, lung, colon and cervical cancers, and also 
5 breast, prostate or leukemias. The SP, thus, can serve as a diagnostic 
marker for tumors. 

In other instances the SP protein can exhibit altered activity by 
virtue of a change in activity or expression of a co-factor therefor or a 
substrate therefor. Detection of the SPs, particularly the protease 

10 domains, in body fluids, such as serum, blood, saliva, cerebral spinal 
fluid, synovial fluid and interstitial fluids, urine, sweat and other such 
fluids and secretions, can serve as a diagnostic tumor marker. In 
particular, detection of higher levels of such polypeptides in a subject 
compared to a subject known not to have any neoplastic disease or 

15 compared to earlier samples from the same subject, can be indicative of 
neoplastic disease in the subject. 
CVSP16 

Provided are polypeptides family members designated CVSP16. 
The CVSP1 6s provided herein are serine proteases that are expressed 

20 and/or activated in certain tumors; hence their activation or expression 
can serve as a diagnostic marker for tumor development, growth and/or 
progression. The CVSP16 polypeptides provided herein and can be used 
as a drug or therapeutic target and used in screening assays, including 
those exemplified herein. Dimerized and higher multimers of CVSP16 

25 polypeptides and/or portions thereof are provided. 

A single chain proteolytic domain can function in vitro and, hence 
is useful in in vitro assays for identifying agents that modulate the activity 
of members of this family. In addition two-chain or three chain forms, the 
activated full-length or truncated forms thereof, such as forms in which 

30 the signal peptide is removed, also can be used in such assays. Assays 
for activation also are provided. 
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The CVSP16 polypeptides provided herein are expressed or 
activated or active in tumor cells, typically at a level that differs from the 
level in which they are expressed or activated or active in the non-tumor 
cell of the same type. Hence, for example, if the CVSPT6 is expressed in 
5 a prostate or ovarian tumor cell, to be of interest herein with respect to 
ovarian or prostate cancer, it is expressed or activated or active at a 
different level in non-tumor prostate or ovarian cells. CVSP16 
polypeptide is expressed in lung, colon, prostate, breast, uterine, ovarian 
and other tumor cells. 

10 As discussed, the members of this family are involved in proteolytic 

processes that are implicated in tumor development, growth and/or 
progression. This implication is based upon their functions as proteolytic 
enzymes in processes related to ECM degradative pathways. In addition, 
their levels of expression or level of activation or their apparent activity 

15 resulting from substrate levels or alterations in substrates and levels 
thereof differs in tumor cells and non-tumor cells in the same tissue. 
Also, in some instances, the level of activation and/or expression can be 
altered in tumors, such as pancreas, stomach, uterus, lung, colon and 
cervical cancers, and also breast, prostate or leukemias. The SP, thus, 

20 can serve as a diagnostic marker for tumors. 

It is shown herein, that CVSP16s provided herein are expressed 
and/or activated in certain tumors; hence their activation or expression 
can serve as a diagnostic marker for tumor development, growth and/or 
progression. The CVSP16 also is intended for use as a drug target and 

25 used in screening assays, including those CSVP16s exemplified herein. 
The CVSP16 polypeptides provided herein are expressed or activated by 
or in tumor cells, typically at a level that differs from the level in which 
they are expressed by the non-tumor cell of the same type. Hence, for 
example, if the SP expressed by a prostate or ovarian tumor cell is to be 

30 of interest herein with respect to ovarian or prostate cancer, it should 
have an expression, extent of activation or activity that is different from 
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that in non-tumor cells. CVSP16 is expressed in lung, colon, prostate, 
breast, uterine, ovarian and other tumor cells. 

In certain embodiments, a CVSP1 6 polypeptide or a portion thereof 
is detectable in a body fluid at a level that differs from its level in body 
5 fluids in a subject not having a tumor. In other embodiments, the 
polypeptide is present in a tumor; and a substrate or cofactor for the 
polypeptide is expressed at levels that differ from its level of expression in 
a non-tumor cell in the same type of tissue. 

Detection of the SP, or a protease domain thereof, in body fluids, 

10 such as serum, blood, saliva, cerebral spinal fluid, synovial fluid and 

interstitial fluids, urine, sweat and other such fluids and secretions, can 
serve as a diagnostic tumor marker. In particular, detection of higher 
levels or alternative forms of such polypeptides in a subject compared to 
a subject known not to have any neoplastic disease or compared to earlier 

15 samples from the same subject, can be indicative of neoplastic disease in 
the subject. 

Isolated, substantially pure proteases that contain protease 
domains or catalytically active (or functionally active) portions thereof in a 
single chain form of SPs are provided. A protease domain or plurality 

20 thereof can be included in a longer protein, and such longer protein is 
optionally the CVSP1 6 zymogen. In particular, exemplary protease 
domains include at least a sufficient portion of sequences of amino acids 
set forth in SEQ ID No. 6 (encoded by nucleotides in SEQ ID No. 5) to 
exhibit protease activity in an assay provided herein. 

25 The domains, fragments, derivatives or analogs of a CVSP16 that 

are functionally active are capable of exhibiting one or more functional 
activities associated with the CVSP16 polypeptide, such as serine 
protease activity, immunogenicity and antigenicity, are provided. 
CVSP16 contains a signal peptide sequence (aa 1 to aa 23) and two 

30 protease domains. Also included are substantially purified CVSP16 
zymogen, activated multi-chain forms of the polypeptide, single chain 
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protease domains, two-chain protease domains. These are encoded by a 
nucleic acid that includes sequence encoding at least one protease 
domain that exhibits proteolytic activity and that hybridizes to a nucleic 
acid molecule including the sequence of nucleotides set forth in SEQ ID 
5 No. 5, typically under moderate, generally under high stringency, 

conditions and generally along the full length (or at least about 70, 80 or 
90%) of a protease domain. Polypeptides encoded by splice variants are 
also provided as long as the polypeptides do not include at least 5, 7, 10, 
15, 20 or more contiguous amino acids set forth in SEQ ID No. 21. 
10 CVSP16 polypeptides 

As described CVSP16 polypeptides provided herein include 
polypeptides: 

a) a polypeptide encoded by the sequence of nucleotides set 
forth in SEQ ID No. 5; 

15 b) a polypeptide encoded by a sequence of nucleotides that 

hybridizes under conditions of low, moderate or high stringency to the 
sequence of nucleotides set forth in SEQ ID No. 5, wherein the encoded 
polypeptide does not include at least 5 or at least 6 or at least 7 or at 
least 8 contiguous or at least 9 or at least 10 contiguous amino acids 

20 from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly do not 
include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6; 

c) a polypeptide that contains the sequence of amino acids 
set forth in SEQ ID No. 6, particularly, amino acids 24-752 and C-terminal 

25 truncated portions thereof; 

d) a polyeptide that contains at least two protease domains 
of a serine protease 16 (CVSP16) and includes at least 5 contiguous 
amino acids corresponding to residues 508-544 of SEQ ID No. 6. or 
contains the contiguous sequence Asn Asp Ser or Trp Asn Asp or Ser 

30 Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr His or Leu Gin 
Thr His in the second protease domain; 
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e) a polypeptide that contains a sequence of amino acids 
having at least about 60%, 70%, 80%, 90% or about 95% sequence 
identity with the sequence of amino acids set forth in SEQ ID No. 6, 
wherein the polypeptide does not include at least 5 or at least 6 or at 

5 least 7 or at least 8 contiguous or at least 9 or at least 10 contiguous 
amino acids from SEQ ID No. 21 up to all of SEQ ID No. 21, and 
particularly does not include any amino acids therefrom between amino 
acids corresponding to Gln 660 and Met 661 of SEQ ID No. 6; 

f) the polypeptide is encoded by a sequence of 

10 nucleotides that hybridizes under conditions of at least moderate, and can 
be high, stringency along at least 70% of its full length to a sequence of 
nucleotides than encodes a polypeptide of any of a)-e), wherein the 
polypeptide does not include at least 5 or at least 6 or at least 7 or at 
least 8 contiguous or at least 9 or at least 10 contiguous amino acids 

15 from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly do not 
include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6; 

g) the polypeptide has at least 60%, 70%, 80%, 90% or 
about 95% sequence identity with a polypeptide of any of a)-f), wherein 

20 the polypeptide does not include at least 5 or at least 6 or at least 7 or at 
least 8 contiguous or at least 9 or at least 10 contiguous amino acids 
from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly do not 
include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6; and/or 

25 h) a polypeptide encoded by a splice variant of a sequence of 

nucleotides that encodes a CVSP16 polypeptide provided herein. Smaller 
portions thereof that retain protease activity are contemplated. 

An exemplary CVSP16 polypeptide includes the sequence of amino 
acids set forth in SEQ ID No. 6; the protease domains are included in the 

30 sequence of amino acids set forth in SEQ ID No. 6. 
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Also, provided herein are isolated substantially pure polypeptides 
that contain a protease domain of a CVSP1 6. The polypeptide protein 
also can include other non-CVSP16 sequences of amino acids, but 
includes a protease domain or a sufficient portion thereof to exhibit 
5 catalytic activity (or other functional activity, such as substrate or ligand 
binding activity) in any in vitro assay that assess such protease activity, 
such as any provided herein. The CVSP16 polypeptides do not include 
the sequence of amino acids set forth in SEQ ID No. 21 and/or they do 
not include at least 5, 7, 9, 1 0, 1 1 , 1 5, 20 or more contiguous amino 

10 acids thereof between amino acids that correspond to Q660 and M661 of 
SEQ ID No. 6 and/or the CVSP1 6 polypeptide contains at least two 
protease domains of a serine protease 16 (CVSP16) and includes at least 
5 contiguous amino acids corresponding to residues 508-544 of SEQ ID 
No. 6. (SEQ ID No. 22) or contains the contiguous sequence Asn Asp Ser 

15 or Trp Asn Asp or Ser Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or 
Gin Thr His or Leu Gin Thr His in the second protease domain. Such 
polypeptides are zymogens or activated single- two- or three-chain 
molecules. 

Provided are substantially purified CVSP1 6 zymogens, activated 
20 two and three chain forms, single chain protease domains and two-chain 
protease domains. A full-length exemplary CVSP16 polypeptide, 
including the signal sequence, is set forth in SEQ ID No. 6 The signal 
sequence (see amino acids 1-23, SEQ ID No. 6) can be cleaved upon 
expression or the encoding nucleic acid can be deleted prior to 
25 expression. 

Also provided is a substantially purified protein including a 
sequence of amino acids that has at least 60%, 70%, 80%, 90% or 
about 95%, identity to the exemplified CVSP16 or to a protease domain 
thereof, and does not include at least 5, 7, 9, 1 1, 15, 20 or more 
30 contiguous amino acids of SEQ ID No. 21. Percentage identity is 

determined using standard algorithms and gap penalties that maximize the 
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percentage identity. A huma CVSP16 polypeptide is exemplified, 
although other mammalia CVSP16 polypeptides are contemplated and are 
obtained by standard methods using the disclosed CVSP1 6-encoding 
nucleic acid (or antibodies made to the CVSP16) to isolate corresponding 
5 nucleic acid molecules (and/or CVSP16s) from other species. 

Polypeptides peptides encoded by splice variants of the exemplified 
encoding nucleic acid (SEQ ID No. 5), particularly those with a 
proteolytically active protease domain, but not containing at least 5, 7, 
10, 15, 20 or more contiguous amino acids from SEQ ID No. 21, 

10 particularly inserted between the amino acids corresponding to Q660 and 
M661 (SEQ ID No. 6) are provided. 

Provided are substantially purified CVSP1 6 polypeptides and 
functional domains thereof, including catalytically active domains and 
portions, that have at least about 60%, 70%, 80%, 90% or about 95% 

15 sequence identity with a CVSP16 that includes the sequence of amino 
acids set forth in SEQ ID No. 6 or a catalytically active portion thereof. 
The CVSP16 polyeptides provided herein do not include at least 5 
contiguous amino acid residues as set forth in SEQ ID No. 21, particularly 
between residues corresponding to Q 660 and M 661 of SEQ ID No. 6. Also 

20 provided are polypeptides that are encoded by the nucleic acid molecules 
provided herein. Polypeptides that contain at least two protease domains 
of a serine protease 16 (CVSP16) and includes at least 5 contiguous 
amino acids corresponding to residues 508-544 of SEQ ID No. 6. or 
contains the contiguous sequence Asn Asp Ser or Trp Asn Asp or Ser 

25 Cys Trp Asn Asp Ser or Cys Trp Asn Asp Ser or Gin Thr His or Leu Gin 
Thr His in the second protease domain. 

In other embodiments, substantially purified polypeptides that 
include a protease domain of a CVSP16 polypeptide or a catalytically 
active portion thereof, but that do not include the entire sequence of 

30 amino acids set forth in SEQ ID No. 6 are provided. Among these are 
polypeptides that include a sequence of amino acids that has at least 
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60%, 70%, 80%, 90%, 95% or 100% sequence identity to SEQ ID 
No. 6. 

Included among the polypeptides provided herein is a CVSP16 
protease domain and/or a polypeptide with amino acid changes such that 
5 the specificity and protease activity (or other functional activity) is not 
eliminated and is retained at least 1%, 2%, 3%, 5%, 10%, 20%, 30%, 
40%, 50% activity or remains substantially unchanged (more than about 
50%) or increases. The CVSP16 polypeptide can form homodimers and 
also can form heterodimers with some other protein, such as a 

10 membrane-bound protein. 

Domains, fragments, derivatives or analogs of a CVSP1 6 that are 
functionally active are capable of exhibiting one or more functional 
activities and/or other activities or properties, such as immunogenicity 
and antigenicity, also are provided. 

1 5 Protease domains and antigenic fragments 

Isolated, substantially pure polypeptides that include the protease 
domains or catalytically active portions thereof as single chain forms of 
SPs are provided. The protease domains can be included in a longer 
protein, and such longer proteins include up to a full-length CVSP16 

20 zymogen as long as the full-length polypeptide does not include SEQ ID 
No. 21 . Provided herein are isolated substantially pure polypeptides that 
contain a protease domain of a CVSP16 as a single chain. 

SP protease domains include the single-chain protease domains of 
CVSP1 6. Provided are the protease domains or proteins that include a 

25 portion of a SP that is the protease domain of any SP, particularly a 

CVSP16. The protein also can include other non-SP sequences of amino 
acids, but includes the protease domain or a sufficient portion thereof to 
exhibit catalytic and/or binding activity in any in vitro assay that assesses 
such activity(ies), such as any provided herein. Also provided are two- 

30 and three-chain activated forms of the full length protease and also two 
chain forms of a protease domain. 
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In an embodiment, the substantially purified CVSP16 protease is 
encoded by a nucleic acid that hybridizes to a nucleic acid molecule 
containing the protease domain encoded by the nucleotide sequence set 
forth in SEQ. ID No. 5 under at least moderate, generally high, stringency 
5 conditions, such that the protease domain encoding nucleic acid thereof 
hybridizes along its full length or along at least about 70%, 80% or 90% 
of the full length. In other embodiments the substantially purified 
CVSP16 protease is a single chain polypeptide that includes substantially 
the sequence of amino acids set forth as amino acids 24-752 in SEQ ID 

10 No. 6, or a catalytically active (or functionally active or immunogenic or 
antigenic) portion thereof. 

Polypeptides that additionally include amino acids at the C- 
terminus, such as all or a portion of the amino acids following the 
protease domain (aa 551 to aa 752 in SEQ ID No. 6) in the exemplified 

15 embodiment are provided. Dimers and other multimers of the full length 
and catalytically active portions of the polypeptides that include PD1 
and/or PD2 are provided. 

A signal peptide (amino acids 1-23 of SEQ ID No. 6 in the 
exemplified embodiment) also is provided. In addition the mature 

20 CVSP16 polypeptide with the signal sequence removed and catalytically 
active portions thereof, including those that include all or a portion of the 
C-terminus beyond the protease domain are provided. 

As described below, all forms of the CVSP16, including the pro- 
polypeptide with the signal sequence, the mature polypeptide and 

25 catalytically active portions thereof, the protease domains and 

catalytically active portions thereof, three-chain, two-chain and single 
chain forms of any of these polypeptides are provided herein and can be 
used in the screening assays and for preparing specific antibodies 
therefor. The expression, quantity, activity and/or activation of the 

30 polypeptides in tumor cells and body fluids can be diagnostic of disease 
or its absence. 
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CVSP16 has two protease domains PD1 and PD2 (see, e.g., amino 
acids 46 to 286 an amino acids 323 to 550 of SEQ ID No. 6, respec- 
tively). Polypeptides that contain PD1 or PD2 as the only CVSP16 
portion thereof are provided or that contain PD1 and PD2 but do not 
5 include at least 5, 7, 10, 15, 20 or more contiguous amino acids of SEQ 
ID No. 21 are provided. 

With reference to SEQ ID No. 6 of the exemplified CVSP16, the 
exemplified CVSP16 has 8 putative /V-linked glycosylation sites 
( . . . Ng 2 GT. . . , . . . N i 3qYS . . . , . . . N 2 -j 7LT. . . , . . . N31 7 CT. . . , . . . N 369SS . . . , . . . N 4 q 2 AS. . . , 
10 ...N 421 LS..., ...N 508 DS...). The following cysteine pairings are noted: C 72 - 

^88' C 173 -C 2 49, ^206"*^228' C 239 -C 267 , ^348*^364' C 444 -C 516 , C 472 ~C 494 and C 50 g- 

C 534 . In addition, an unpaired cysteine (C 159 ) in the first protease domain 
should pair with C 38 . An unpaired cysteine (C 430 ) in the second protease 
domain should pair with C 325 or C 318 . As a result the protease, upon 

15 activation cleavage can contain multiple chains, including two or three or 
more chains, formed by virtue of pairing between the unpaired cysteine 
in a protease domain with a cysteine outside a protease domain. 

The first protease domain is characterized by the presence of a 
protease activation cleavage site (...R 46 i l 47 VGGSNAQP..., where i 

20 indicates protease activation cleavage site) at the beginning of the domain 
and catalytic triad residues (H 87 , D 139 and S 243 ) in 3 highly-conserved 
regions of the catalytic domain. In the second protease domain, the 
invariant catalytic histidine is replaced by a serine (S 363 ) residue, and the 
highly conserved SGGP sequence that contains the catalytic serine is 

25 replaced with the sequence S 510 RWS. In addition, PD2 begins at amino 
acid 323-324 (Pro 323 Glu 324 ), which provides an unusual cleavage site. 
Since Pro-Glu residues as a cleavage site is unusual, cleavage may not be 
needed for activation. These sequences and differences from other 
protease domains indicate that the second protease domain has lower 

30 catalytic activity. 
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Antigenic epitopes 

Antigenic epitopes that contain at least 4, 5, 6, 7, 8, 9, 10, 1 1, 
12, 13, 14, 15, 20, 25, 30, 40, 50, and typically 10-15 amino acids of 
the CVSP16 polypeptide are provided. These antigenic epitopes are used, 
5 for example, to raise antibodies. Antibodies, see further discussion 
below, specific for each epitope or combinations thereof are also 
provided. Also provided are antibodies that bind with at least 10-fold or 
100-fold greater affinity to CVSP16 polypeptides that do not include the 
sequence of amino acids set forth in SEQ ID No. 21 (particularly do not 

10 include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6) compared to those that include SEQ ID 
No. 21 . In particular, provided are antibodies that bind to a CVSP16 
polypeptide provided herein that does not include at least 5, 7, 9, 10, 15, 
20 contiguous amino acids of SEQ ID No. 21, particularly where the 

15 contiguous amino acids are inserted between amino acids corresponding 
to Q 660 and M 661 , with at least 2-, 5-, 10-, 100-fold greater affinity than to 
a polypeptide that includes the least 5 contiguous amino acids set forth in 
SEQ ID No. 21, particularly where the continuous amino acids are 
inserted between amino acids corresponding to Q 660 and M 661 . 

20 Muteins and derivatives of CVSP16 polypeptides 

Full-length CVSP16, zymogen and activated forms thereof and 
CVSP1 6 protease domains, portions thereof, and muteins and derivatives 
of such polypeptides are provided. Among the derivatives are those 
based on animal CVSP16s, including, but are not limited to, rodent, such 

25 as mouse and rat; fowl, such as chicken; ruminants, such as goats, 
cows, deer, sheep; ovine, such as pigs; and humans. For example, 
CVSP16 derivatives can be made by altering their sequences by 
substitutions, additions or deletions. CVSP16 derivatives include, but are 
not limited to, those containing, as a primary amino acid sequence, all or 

30 part of the amino acid sequence of CVSP16, including altered sequences 
in which functionally equivalent amino acid residues are substituted for 
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residues within the sequence resulting in a silent change. For example, 
one or more amino acid residues within the sequence can be substituted 
by another amino acid of a similar polarity which acts as a functional 
equivalent, resulting in a silent alteration. Substitutes for an amino acid 
5 within the sequence can be selected from other members of the class to 
which the amino acid belongs. For example, the nonpolar (hydrophobic) 
amino acids include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

10 glutamine. The positively charged (basic) amino acids include arginine, 
lysine and histidine. The negatively charged (acidic) amino acids include 
aspartic acid and glutamic acid (see, e.g., Table 1). Muteins of the 
CVSP16 or a domain thereof, such as a protease domain, in which up to 
about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90% or 

15 95% of the amino acids are replaced with another amino acid are 

provided. Generally such muteins retain at least about 1 %, 2%, 3,%, 
5%, 7%, 8%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 
95% or more (or in increased activity, i.e., 101, 102, 103, 104, 105, 
1 10% or greater) of the protease activity of the unmutated protein. 

20 Those of skill in the art recognize that a polypeptide that retains at least 
1 % of the activity of the wild-type protease is sufficiently active for use 
in screening assays or in other applications. 

CVSP1 6 derivatives can be made by altering their sequences by 
substitutions, additions or deletions that provide for functionally 

25 equivalent molecules. Due to the degeneracy of nucleotide coding 

sequences, other nucleic sequences which encode substantially the same 
amino acid sequence as a CVSP16 are provided. These include but are 
not limited to nucleotide sequences containing all or portions of CVSP16 
genes that are altered by the substitution of different codons that encode 

30 the amino acid residue within the sequence, thus producing a silent 
change. CVSP16 derivatives include, but are not limited to, those 
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containing, as a primary amino acid sequence, all or part of the amino 
acid sequence of SP, including altered sequences in which amino acid 
residues are substituted for residues within the sequence resulting in a 
silent change or a change that eliminates an activity. Substitutes for an 
5 amino acid within the sequence can be selected from other members of 
the class to which the amino acid belongs or from another class (for non- 
conservative changes). 

Muteins in which one or more of the Cys residues, particularly, a 
residue that is paired in the activated two-chain or three-chain form, but 

10 unpaired in the protease domain alone {i.e., a Cys at residue no. 159 

and/or 430 in SEQ ID No. 6) is/are replaced with any amino acid, typically, 
although not necessarily, a conservative amino acid residue, such as Ser, 
are contemplated. Muteins in which 10%, 20%, 30%, 35%, 40%, 45%, 
50% or more of the amino acids are replaced but the resulting polypeptide 

15 retains at least about 1%, 3%, 5%, 10%, 20%, 30%, 35%, 40%, 45%, 
50%, 60%, 70%, 80%, 90% or 95% of the catalytic activity (or other 
functional activity or other property, such as antigenicity or 
immunogenicity) as the unmodified form for the same substrate also are 
provided. 

20 Nucleic acid molecules, vectors and plasmids, cells and 

expression of CVSP16 polypeptides 

Also provided herein are nucleic acid molecules that encode SP 

proteins and the encoded proteins. In particular, nucleic acid molecules 

encoding CVSP16 from animals, including splice variants thereof are 

25 provided. The encoded proteins are also provided. Also provided are 
functional domains thereof. For example, the SP (CVSP16) protease 
domains, portions thereof, and muteins thereof are from or based on 
animal SPs (CVSPs), including, but are not limited to, rodent, such as 
mouse and rat; fowl, such as chicken; ruminants, such as goats, cows, 

30 deer, sheep; ovine, such as pigs; and humans. The isolated nucleic acid 
fragment is DNA, including genomic or cDNA, or is RNA, or can include 
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other components, such as PNA. The isolated nucleic acid can include 
additional components, such as heterologous or native promoters, and 
other transcriptional and translational regulatory sequences, these genes 
can be linked to other genes, such as reporter genes or other indicator 
5 genes or genes that encode indicators. 

Nucleic acid molecules 
Due to the degeneracy of nucleotide coding sequences, other nucleic 
acid sequences that encode substantially the same amino acid sequence as 
CVSP16 gene (or cDNA or RNA) can be used. These include but are not 

10 limited to nucleotide sequences comprising all or portions of CVSP16 genes 
that are altered by the substitution of different codons that encode the 
amino acid residue within the sequence, thus producing a silent change. 

Also provided are nucleic acid molecules that hybridize to the above- 
noted sequences of nucleotides encoding CVSP1 6 at least at low 

15 stringency, at moderate stringency, and/or at high stringency, and that 

encode the protease domain and/or the full length protein or other domains 
of a CVSP16 or a splice variant or allelic variant thereof. Generally the 
molecules hybridize under such conditions along their full length (or along 
at least about 70%, 80% or 90% of the full length) for at least one domain 

20 and encode at least one domain, such as the protease domain, of the 
polypeptide. In particular, such nucleic acid molecules include any 
isolated nucleic acid fragment that encodes at least one domain of a serine 
protease, that (1) contains a sequence of nucleotides that encodes the 
protease or a functionally active, such as catalytically active, domain 

25 thereof, and (2) is selected from among sequences of nucleic acids that 
encode a CVSP16 polypeptide: 

a) a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 5; 

b) a polypeptide encoded by a sequence of nucleotides that 

30 hybridizes under conditions of low, moderate or high stringency to the 
sequence of nucleotides set forth in SEQ ID No. 5, wherein the encoded 
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polypeptide does not include at least 5, 6, 7, 8, 9, 15 or 20 contiguous 
amino acids from SEQ ID No. 21 up to all of SEQ ID No. 21, and 
particularly do not include the contiguous amino acids therefrom between 
amino acids corresponding to Gln 660 and Met 661 of SEQ ID No. 6; 
5 c) a polypeptide that contains the sequence of amino acids set forth 

in SEQ ID No. 6 or that contains amino acid residues 24-752 or 
functionally active, particularly, catalytically active, portions thereof; 

d) a polypeptide that contains a sequence of amino acids having at 
least about 60%, 70%, 80%, 90% or about 95% sequence identity with 

10 the sequence of amino acids set forth in SEQ ID No. 6, where the 

polypeptide does not include at least at least 5, 6, 7, 8, 9, 10, 15 or 20 
contiguous amino acids from SEQ ID No. 21 up to all of SEQ ID No. 21, 
and particularly do not include any amino acids therefrom between amino 
acids corresponding to Gln 660 and Met 661 of SEQ ID No. 6; 

15 e) a polypeptide that contains at least two protease domains of a 

serine protease 16 (CVSP16) and includes at least 5 contiguous amino 
acids corresponding to residues 508-544 of SEQ ID No. 6. or contains the 
contiguous sequence Asn Asp Ser or Trp Asn Asp or Ser Cys Trp Asn Asp 
Ser or Cys Trp Asn Asp Ser or Gin Thr His or Leu Gin Thr His in the second 

20 protease domain; 

f) a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of at least moderate, and can be high, 
stringency along at least 70% of its full length to a sequence of 
nucleotides that encodes a polypeptide of any of a)-e), wherein the 

25 polypeptide does not include at least 5, 6, 7, 8, 9, 10, 15 or 20 

contiguous amino acids contiguous amino acids from SEQ ID No. 21 up to 
all of SEQ ID No. 21, and particularly does not include any amino acids 
therefrom between amino acids corresponding to Gln 660 and Met 661 of SEQ 
ID No. 6; 

30 g) the polypeptide has at least 60%, 70%, 80%, 90% or about 

95% sequence identity with a polypeptide of any of a)-), where the 
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polypeptide does not include at least about 5, 6, 7, 8, 9, 10, 15 or 20 
contiguous amino acids from SEQ ID No. 21 up to all of SEQ ID No. 21, 
and particularly do not include any amino acids therefrom between amino 
acids corresponding to Gln 660 and Met 661 of SEQ ID No. 6; and/or 
5 h) a polypeptide encoded by a splice variant of a sequence of 

nucleotides that encodes a CVSP16 polypeptide, including a polypeptide of 
any of a)-f), as long as the resulting polypeptide the polypeptide does not 
include at least about 5, 6, 7, 8, 9, 10, 15 or 20 contiguous amino acids 
from SEQ ID No. 21 up to all of SEQ ID No. 21, and particularly does not 

10 include any amino acids therefrom between amino acids corresponding to 
Gln 660 and Met 661 of SEQ ID No. 6 and/or the polypeptide encoded by the 
splice variant contains at least two protease domains of a serine protease 
16 (CVSP16) and includes at least 5 contiguous amino acids corresponding 
to residues 508-544 of SEQ ID No. 6. or contains the contiguous sequence 

15 Asn Asp Ser or Trp Asn Asp or Ser Cys Trp Asn Asp Ser or Cys Trp Asn 
Asp Ser or Gin Thr His or Leu Gin Thr His in the second protease domain. 
Smaller nucleic acid molecules that encode polypeptides that retain 
protease activity as single chains or as other truncated single-chain forms 
are provided. 

20 Hence, among the nucleic acid molecules are those that do not 

encode a polypeptide that includes at least about 5, 6, 7, 8, 9, 10, 15 or 
20 contiguous amino acids from SEQ ID No. 21 up to all of SEQ ID No. 
21, and particularly do not include any amino acids therefrom between 
amino acids corresponding to Gln 660 and Met 661 of SEQ ID No. 6 and 

25 contain: 

(a) a sequence of nucleotides that encodes the CVSP1 6 
polypeptide or a domain thereof, particularly a CVSP16 
polypeptide of a)-h) above; 

(b) a sequence of nucleotides that encodes such portion or the 
30 full length CVSP16 protease and hybridizes under conditions 

of moderate or high stringency, generally to nucleic acid that 
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is complementary to an mRNA transcript present in a 
mammalian cell that encodes a mammalian CVSP16 protein or 
a fragment thereof; 

(c) a sequence of nucleotides that encodes a CVSP1 6 protease or 
5 a domain thereof that includes a sequence of amino acids 

encoded by such portion or the full length open reading frame; 
and 

(d) a sequence of nucleotides that encodes a CVSP16 protease 
that includes a sequence of amino acids encoded by a 

10 sequence of nucleotides that encodes such protease and 

hybridizes under conditions of moderate or high stringency to 
DNA that is complementary to a mRNA transcript. 

(e) a sequence of nucleotides that includes degenerate codons of 
all or a portion of any of (a)-(d). 

15 The isolated nucleic acid fragment is DNA, including genomic or 

cDNA, or is RNA, or can include other components, such as peptide 
nucleic acid (PNA) or other modified nucleotides or ribonucleotides or 
analogs thereof. The isolated nucleic acid can include additional 
components, such as heterologous or native promoters, and other 

20 transcriptional and translational regulatory sequences, these genes can be 
linked to other genes, such as reporter genes or other indicator genes or 
genes that encode indicators. 

In an exemplary embodiment, a nucleic acid molecule that encodes a 
CVSP16 polypeptide is provided. In particular, an encoding nucleic acid 

25 molecule with an open reading frame within the sequence of nucleotides 
set forth in SEQ ID No. 5 is provided. The encoded protein is set forth in 
SEQ ID NO. 6 (see, also EXAMPLE 1). Also provided a nucleic acid 
molecules that encode the mature polypeptide (residues 24-752) and one 
or both protease domains. The encoded polypeptide does not include at 

30 least about 5, 6, 7, 8, 9, 10, 15 or 20 contiguous amino acids from SEQ 
ID No. 21 up to all of SEQ ID No. 21, and particularly do not include any 
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amino acids therefrom between amino acids corresponding to Gln 660 and 
Met 661 of SEQ ID No. 6. 

Also provided are nucleic acid molecules that hybridize under 
conditions of at least low stringency, typically moderate stringency, and 
5 generally high stringency, and typically along at least 70%, 80%, 90% or 
95% of their full length to the above sequence of nucleic acids (SEQ ID 
No. 5 or degenerates thereof), particularly to the open reading frame 
encompassed by nucleotides that encode a single protease domain or any 
domain of CVSP16, and/or to any of the CVSP16 polypeptides and 

10 fragments thereof as described herein. 

Also provided are nucleic acid molecules that encode a single chain 
CVSP16 protease that has proteolytic activity in an in vitro proteolysis 
assay and that encode a polypeptide that has at least 80%, 85%, 90% or 
95% sequence identity with the full length CVSP16 polypeptide of SEQ ID 

15 No. 6. 

Also provided are nucleic acid molecules that encode a single chain 
CVSP16 protease that has proteolytic activity in an in vitro proteolysis 
assay and that encode a polypeptide that has at least 60%, 70%, 80%, 
85%, 90% or 95% sequence identity with the full length of a protease 

20 domain of the CVSP16 polypeptide of SEQ ID No. 6, or that hybridize 

along their full length to a nucleic acids that encodes a protease domain, 
particularly under conditions of moderate, generally high, stringency. As 
above, the encoded polypeptides contain the protease as a single chain. 
The isolated nucleic acids can include at least 8 nucleotides of a 

25 CVSP1 6-encoding sequence. In other embodiments, the nucleic acids can 
contain at least 10, 14, 16, 50, 100 nucleotides, 150 nucleotides, or 200 
nucleotides of a CVSP1 6-encoding sequence provided that the nucleic acid 
molecules includes codons encoding Q660 and M661 as contiguous amino 
acids, particular those encoding amino acids 655-665 or smaller portions 

30 thereof that include Q 6SO and M 66 i of SEQ ID No. 6. The nucleic acid 
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molecules generally include sequence of nucleotides that encode residues 
that correspond to Q 660 and M 661 of SEQ ID No. 6. 

For each of the nucleic acid molecules, the nucleic acid can be DNA 
or RNA or PNA or other nucleic acid analogs or can include non-natural 
5 nucleotide bases. Also provided are isolated nucleic acid molecules that 
include a sequence of nucleotides complementary to the nucleotide 
sequence encoding a SP. 

Probes, primers, antisense oligonucleotides and dsRNA 
Also provided are fragments thereof or oligonucleotides that can be 

10 used as probes or primers and that contain at least about 10, 14, 16 
nucleotides, generally less than 1000 or less than or equal to 100, set 
forth in SEQ ID No. 5 (or the complement thereof); or contain at least 
about 30 nucleotides (or the complement thereof) or contain 
oligonucleotides that hybridize along their full length or along at least about 

15 70%, 80% or 90% of their full length to any such fragments or 

oligonucleotides. The length of the fragments is a function of the purpose 
for which they are used and/or the complexity of the genome of interest. 
Generally probes and primers contain less than about 500, 150, 100 
nucleotides. The probes and primers generally span residues that 

20 correspond to Q 660 and M 661 . 

Probes and primers derived from the nucleic acid molecules are 
provided. Such probes and primers contain at least 8, 14, 16, 30, 100 or 
more contiguous nucleotides with identity to contiguous nucleotides of a 
CVSP16, particularly those that span nucleotides corresponding to 1978- 

25 1983 of SEQ ID No. 5. The probes and primers are optionally labelled with 
a detectable label, such as a radiolabel or a fluorescent tag, or can be mass 
differentiated for detection by mass spectrometry or other means. 

Also provided are fragments thereof or oligonucleotides that can be 
used as probes or primers and that contain at least about 10, 14, 16 

30 nucleotides, generally less than 1000 or less than or equal to 100, set 

forth in SEQ ID No. 5 (or the complement thereof), particularly those that 
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span nt. 1978-1983; or contain at least about 30 nucleotides (or the 
complement thereof) or contain oligonucleotides that hybridize along their 
full length (or at least about 70, 80 or 90% thereof) to any such fragments 
or oligonucleotides. The length of the fragments are a function of the 
5 purpose for which they are used and/or the complexity of the genome of 
interest. Generally probes and primers contain less than about 500, 1 50, 
100 nucleotides. 

Also provided is an isolated nucleic acid molecule that includes the 
sequence of molecules that is complementary to the nucleotide sequence 

10 encoding CVSP16 or the portion thereof. Double-stranded RNA (dsRNA), 
such as RNAi also is provided. 

Plasmids, vectors and cells 
Plasmids and vectors containing the nucleic acid molecules are also 
provided. Cells containing the vectors, including cells that express the 

15 encoded proteins are provided. The cell can be a bacterial cell, a yeast 

cell, a fungal cell, a plant cell, an insect cell or an animal cell. Methods for 
producing a SP or single chain form of the protease domain thereof by, for 
example, growing the cell under conditions whereby the encoded SP is 
expressed by the cell, and recovering the expressed protein, are provided 

20 herein. As noted, for CVSP16, the full-length zymogens and activated 
proteins and activated (two- or three-chain) protease and single chain 
protease domains are provided. 

As discussed below, the CVSP16 polypeptide, and catalytically 
active portions thereof, can be expressed as a secreted protein using the 

25 native signal sequence or a heterologous signal. Alternatively the protein 
can be expressed as inclusion bodies in the cytoplasm and isolated 
therefrom. The resulting protein can be treated to refold (see, e.g., 
EXAMPLE 1). Active protease domain can be produced by expression in 
inclusion bodies, isolation therefrom and denaturation followed by 

30 refolding. 
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C. Tumor specificity and tissue expression profiles 

Each SP has a characteristic tissue expression profile; the SPs in 
particular, although not exclusively expressed or activated in tumors, 
exhibit characteristic tumor tissue expression or activation profiles. In 
5 some instances, SPs can have different activity in a tumor cell from a non- 
tumor cell by virtue of a change in a substrate or cofactor therefor or other 
factor that would alter the apparent functional activity of the SP. Hence 
each can serve as a diagnostic marker for particular tumors, by virtue of a 
level of activity and/or expression or function in a subject (i.e.a mammal, 

10 particularly a human) with neoplastic disease, compared to a subject or 

subjects that do not have the neoplastic disease. In addition, detection of 
activity (and/or expression) in a particular tissue can be indicative of 
neoplastic disease. 

Circulating CSVP16s in body fluids can be indicative of neoplastic 

15 disease. Secreted CVSP16 or activated CVSP16 can be indicative of 

neoplastic disease. Also, by virtue of an activity and/or expression profile 
of each, CVP16s can serve as therapeutic targets, such as by 
administration of modulators of an activity thereof, or, as by administration 
of a prodrug specifically activated by one of the CVSP16s. 

20 Tissue expression profiles 

CVSP16 

Results indicate that the CVSP16 transcript is strongly expressed in 
several tissues including kidney, stomach, colon, spleen, thyroid gland, 
trachea and pituitary gland. The CVSP16 transcript also is found in many 

25 other tissues at a lower level. Among tumor cell lines, the CVSP16 

transcript occurs, for example, in cervical Hela S3, lung A549, leukemia K- 
562, Burkitt's lymphoma Raji, leukemia HL-60, colorectal SW480, Burkitt's 
lymphoma Daudi and leukemia MOLT-4. The CVSP16 transcript also was 
detected (in decreasing signal intensity) in normal breast, normal prostate, 

30 breast carcinoma cell line DU4475, prostate carcinoma cell line PC-3, 
prostate carcinoma cell line LNCaP, breast carcinoma cell line MDA-MB- 
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231, and breast carcinoma cell line MDA-MB-453. Tissue expression 

profiles are described in Example 1 . 

D. Identification and isolation of SP protein genes 

The SP polypeptides, including CVSP1 6 polypeptides, or domains 
5 thereof, can be obtained by methods well known in the art for protein 
purification and recombinant protein expression. Any method known to 
those of skill in the art for identification of nucleic acids that encode 
desired genes can be used. Any method available in the art can be used to 
obtain a full length (i.e., encompassing the entire coding region) cDNA or 

10 genomic DNA clone encoding a SP protein. In particular, the polymerase 
chain reaction (PCR) can be used to amplify a sequence identified as being 
differentially expressed or encoding proteins activated at different levels in 
tumor and non-tumor cells or tissues, e.g., nucleic acids encoding a 
CVSP16 polypeptide (SEQ. NOs: 5, 6, 12 and 13), in a genomic or cDNA 

15 library. Oligonucleotide primers that hybridize to sequences at the 3' and 
5' termini of the identified sequences can be used as primers to amplify by 
PCR sequences from a nucleic acid sample (RNA or DNA), typically a cDNA 
library, from an appropriate source (e.g., tumor or cancer tissue). 

Amplification, such as PCR, can be carried out by a thermal cycler 

20 and thermostable DNA polymerase. The nucleic acid that is amplified can 
include mRNA or cDNA or genomic DNA from any eukaryotic species. One 
can choose to synthesize several different degenerate primers, for use in 
the PCR reactions. It also is possible to vary the stringency of 
hybridization conditions used in priming the PCR reactions, to amplify 

25 nucleic acid orthologs or homologs (e.g., to obtain SP protein sequences 
from species other than humans or to obtain human sequences with 
homology to CVSP16 polypeptide) by allowing for greater or lesser degrees 
of nucleotide sequence similarity between the known nucleotide sequence 
and the nucleic acid homolog being isolated. For cross species 

30 hybridization, low or moderate stringency conditions are used. For same 
species hybridization, moderate or high stringency conditions generally are 



-77- 



used. After successful amplification of the nucleic acid containing all or a 
portion of the identified CVSP16 protein sequence or of a nucleic acid 
encoding all or a portion of a CVSP16 protein homolog, that segment can 
be molecularly cloned and sequenced, and used as a probe to isolate a 
5 complete cDNA or genomic clone. This, in turn, permits the determination 
of the gene's complete nucleotide sequence, the analysis of its expression, 
and the production of its protein product for functional analysis. Once the 
nucleotide sequence is determined, an open reading frame encoding the SP 
protein gene protein product can be determined by any method well known 

10 in the art for determining open reading frames, for example, using publicly 
available computer programs for nucleotide sequence analysis. Once an 
open reading frame is defined, it is routine to determine the amino acid 
sequence of the protein encoded by the open reading frame. In this way, 
the nucleotide sequences of the entire SP protein genes as well as the 

15 amino acid sequences of SP proteins and analogs can be identified. 

Any eukaryotic cell potentially can serve as the nucleic acid source 
for the molecular cloning of the SP protein gene. The nucleic acids can be 
isolated from vertebrate, mammalian, human, porcine, bovine, feline, avian, 
equine, canine, as well as additional primate sources, insects, plants, and 

20 other sources. The DNA can be obtained by standard procedures known in 
the art from cloned DNA (e.g., a DNA "library"), by chemical synthesis, by 
cDNA cloning, or by the cloning of genomic DNA, or fragments thereof, 
purified from the desired cell (see, e.g., Sambrook et al. (2001) Molecular 
Cloning: A Laboratory Manual (3rd Ed.), Vol. 1-3, Cold Spring Harbor 

25 Laboratory, Cold Spring Harbor, N.Y.). Clones derived from genomic DNA 
can contain regulatory and intron DNA regions in addition to coding 
regions; clones derived from cDNA contains only exon sequences. 
Whatever the source, the gene should be molecularly cloned into a suitable 
vector for propagation of the gene. 

30 In the molecular cloning of the gene from genomic DNA, DNA 

fragments are generated, some of which encode the desired gene. The 
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DNA can be cleaved at specific sites using various restriction enzymes. 
Alternatively, one can use DNAse in the presence of manganese to 
fragment the DNA, or the DNA can be physically sheared, for example, by 
sonication. The linear DNA fragments can then be separated according to 
5 size by standard techniques, including but not limited to, agarose and 
polyacrylamide gel electrophoresis and column chromatography. 

Once the DNA fragments are generated, identification of the specific 
DNA fragment containing the desired gene can be accomplished in a 
number of ways. For example, a portion of the SP protein (of any species) 

10 gene (e.g. , a PCR amplification product obtained as described above or an 
oligonucleotide having a sequence of a portion of the known nucleotide 
sequence) or its specific RNA, or a fragment thereof can be purified and 
labeled, and the generated DNA fragments can be screened by nucleic acid 
hybridization to the labeled probe (Benton and Davis, Science 796:180 

15 (1977); Grunstein and Hogness, Proc. Natl. Acad. Sci. U.S.A. 72:3961 
(1975)). Those DNA fragments with substantial homology to the probe 
hybridize. It also is possible to identify the appropriate fragment by 
restriction enzyme digestion(s) and comparison of fragment sizes with 
those expected according to a known restriction map if such is available or 

20 by DNA sequence analysis and comparison to the known nucleotide 

sequence of SP protein. Further selection can be carried out on the basis 
of the properties of the gene. Alternatively, the presence of the gene can 
be detected by assays based on the physical, chemical, or immunological 
properties of its expressed product. For example, cDNA clones, or DNA 

25 clones which hybrid-select the proper mRNA, can be selected which 
produce a protein that, e.g., has similar or identical electrophoretic 
migration, isoelectric focusing behavior, proteolytic digestion maps, 
antigenic properties, serine protease activity. If an anti-SP protein antibody 
is available, the protein can be identified by binding of labeled antibody to 

30 the putatively SP protein synthesizing clones, in an ELISA (enzyme-linked 
immunosorbent assayMype procedure. 



-79- 



Alternatives to isolating the CVSP16 polypeptide genomic DNA 
include, but are not limited to, chemically synthesizing the gene sequence 
from a known sequence or making cDNA to the mRNA that encodes the SP 
protein. For example, RNA for cDNA cloning of the SP protein gene can be 
5 isolated from cells expressing the protein. The identified and isolated 
nucleic acids can then be inserted into an appropriate cloning vector. A 
large number of vector-host systems known in the art can be used. 
Possible vectors include, but are not limited to, plasmids or modified 
viruses, but the vector system must be compatible with the host cell used. 

10 Such vectors include, but are not limited to, bacteriophages such as 
lambda derivatives, or plasmids such as pBR322 or pUC plasmid 
derivatives or the Bluescript vector (Stratagene, La Jolla, CA). The insertion 
into a cloning vector can, for example, be accomplished by ligating the 
DNA fragment into a cloning vector which has complementary cohesive 

15 termini. Insertion can be effected using TOPO cloning vectors 

(INVITROGEN, Carlsbad, CA). If the complementary restriction sites used 
to fragment the DNA are not present in the cloning vector, the ends of the 
DNA molecules can be enzymatically modified. Alternatively, any site 
desired can be produced by ligating nucleotide sequences (linkers) onto the 

20 DNA termini; these ligated linkers can contain specific chemically 

synthesized oligonucleotides encoding restriction endonuclease recognition 
sequences. In an alternative method, the cleaved vector and SP protein 
gene can be modified by homopolymeric tailing. Recombinant molecules 
can be introduced into host cells via, for example, transformation, 

25 transfection, infection, electroporation and sonorporation, so that many 
copies of the gene sequence are generated. 

In specific embodiments, transformation of host cells with 
recombinant DNA molecules that incorporate the isolated SP protein gene, 
cDNA, or synthesized DNA sequence enables generation of multiple copies 

30 of the gene. Thus, the gene can be obtained in large quantities by growing 
transformants, isolating the recombinant DNA molecules from the 
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transformants and, when necessary, retrieving the inserted gene from the 
isolated recombinant DNA. 

E. Vectors, plasmids and cells that contain nucleic acids encoding a SP 
protein or protease domain thereof and expression of SP proteins 

5 Vectors and cells 

For recombinant expression of one or more of the SP proteins, the 
nucleic acid containing all or a portion of the nucleotide sequence encoding 
the SP protein can be inserted into an appropriate expression vector, i.e., a 
vector that contains the necessary elements for the transcription and 

10 translation of the inserted protein coding sequence. The necessary 

transcriptional and translational signals also can be supplied by the native 
promoter for SP genes, and/or their flanking regions. 

Also provided are vectors that contain nucleic acid encoding the 
SPs. Cells containing the vectors are also provided. The cells include 

15 eukaryotic and prokaryotic cells, and the vectors are any suitable for use 
therein. 

Prokaryotic and eukaryotic cells, including endothelial cells, 
containing the vectors are provided. Such cells include bacterial cells, 
yeast cells, fungal cells, Archea, plant cells, insect cells and animal cells. 

20 The cells are used to produce a SP protein or protease domain thereof by 
growing the above-described cells under conditions whereby the encoded 
SP protein or protease domain of the SP protein is expressed by the cell, 
and recovering the expressed protease domain protein. For purposes 
herein, the protease domain can be secreted into the medium. 

25 In one embodiment, the vectors include a sequence of nucleotides 

that encodes a polypeptide that has protease activity and contains all or a 
portion of the protease domain, or multiple copies thereof, of a SP protein 
are provided. Also provided are vectors that contain a sequence of 
nucleotides that encodes the protease domain and additional portions of a 

30 SP protein up to and including a full length SP protein, as well as multiple 
copies thereof, are also provided. The vectors can be selected for 
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expression of the SP protein or protease domain thereof in the cell or such 
that the SP protein is expressed as a secreted protein. When the 
protease domain is expressed the nucleic acid is linked to nucleic acid 
encoding a secretion signal, such as the Saccharomyces cerevisiae a 
5 mating factor signal sequence or a portion thereof, or the native signal 
sequence. 

A variety of host-vector systems can be used to express the protein 
coding sequence. These include but are not limited to mammalian cell 
systems infected with virus (e.g. vaccinia virus, adenovirus and other 

10 viruses); insect cell systems infected with virus (e.g. baculovirus); 
microorganisms such as yeast containing yeast vectors; or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The 
expression elements of vectors vary in their strengths and specificities. 
Depending on the host-vector system used, any one of a number of 

15 suitable transcription and translation elements can be used. 

Any methods known to those of skill in the art for the insertion of 
DNA fragments into a vector can be used to construct expression vectors 
containing a chimeric gene containing appropriate 

transcriptional/translational control signals and protein coding sequences. 

20 These methods can include in vitro recombinant DNA and synthetic 

techniques and in vivo recombinants (genetic recombination). Expression 
of nucleic acid sequences encoding SP protein, or domains, derivatives, 
fragments or homologs thereof, can be regulated by a second nucleic acid 
sequence so that the genes or fragments thereof are expressed in a host 

25 transformed with the recombinant DNA molecule(s). For example, 

expression of the proteins can be controlled by any promoter/enhancer 
known in the art. In a specific embodiment, the promoter is not native to 
the genes for SP protein. Promoters which can be used include but are not 
limited to the SV40 early promoter (Bernoist and Chambon, Nature 

30 250:304-310 (1981)), the promoter contained in the 3' long terminal 

repeat of Rous sarcoma virus (Yamamoto et al. Ce/t 22:787-797 (1980)), 
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the herpes thymidine kinase promoter (Wagner et al., Proc. Natl. Acad. Sci. 
USA 73:1441-1445 (1981)), the regulatory sequences of the 
metailothionein gene (Brinster et aL, Nature 296:39-42 (1982)); 
prokaryotic expression vectors such as the /?-lactamase promoter (Villa- 
5 Kamaroff et aL, Proc. Natl. Acad. Sci. USA 75:3727-3731 1978)) or the 
tac promoter (DeBoer et al., Proc. Natl. Acad. Sci. USA 50:21-25 (1983)); 
see also "Useful Proteins from Recombinant Bacteria": in Scientific 
American 242:79-94 (1980)); plant expression vectors containing the 
nopaline synthetase promoter (Herrar-Estrella et al., Nature 303:209-213 

10 (1984)) or the cauliflower mosaic virus 35S RNA promoter (Garder et al., 
Nucleic Acids Res. 9:2871 (1 981 )), and the promoter of the 
photosynthetic enzyme ribulose bisphosphate carboxylase (Herrera-Estrella 
et al., Nature 370:115-120 (1984)); promoter elements from yeast and 
other fungi such as the Gal4 promoter, the alcohol dehydrogenase 

15 promoter, the phosphoglycerol kinase promoter, the alkaline phosphatase 
promoter, and the following animal transcriptional control regions that 
exhibit tissue specificity and have been used in transgenic animals: 
elastase I gene control region which is active in pancreatic acinar cells 
(Swift et al., Cell 33:639-646 (1 984); Ornitz et al., Cold Spring Harbor 

20 Symp. Quant. Biol. 50:399-409 (1986); MacDonald, Hepatology 7:425- 
515 (1987)); insulin gene control region which is active in pancreatic beta 
cells (Hanahan et al., Nature 3/5:1 15-122 (1985)), immunoglobulin gene 
control region which is active in lymphoid cells (Grosschedl et al., Cell 
33:647-658 (1984); Adams et al., Nature 3/3:533-538 (1985); Alexander 

25 et aL, Mol. Cell Biol. 7:1436-1444 (1987)), mouse mammary tumor virus 
control region which is active in testicular, breast, lymphoid and mast cells 
(Leder et al., Cell 45:485-495 (1986)), albumin gene control region which 
is active in liver (Pinckert et aL, Genes and Devel. /:268-276 (1 987)), 
alpha-fetoprotein gene control region which is active in liver (Krumlauf et 

30 aL, Mol. Cell. Biol. 5:1639-1648 (1985); Hammer eta/., Science 235:53- 
58 1987)), alpha-1 antitrypsin gene control region which is active in liver 
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(Kelsey eta/., Genes and Devef. 7:161-171 (1987)), beta globin gene 
control region which is active in myeloid cells (Mogram et al. f Nature 
375:338-340 (1985); Kollias etaL, Cell 46:89-94 (1986)), myelin basic 
protein gene control region which is active in oligodendrocyte cells of the 
5 brain (Readhead et aL, Cell 45:703-71 2 (1987)), myosin light chain-2 gene 
control region which is active in skeletal muscle (Sani, Nature 314:283- 
286 (1985)), and gonadotrophic releasing hormone gene control region 
which is active in gonadotrophs of the hypothalamus (Mason etaL, 
Science 234:1372-1378 (1986)). 

10 In a specific embodiment, a vector is used that contains a promoter 

operably linked to nucleic acids encoding a SP protein, or a domain, 
fragment, derivative or homolog, thereof, one or more origins of replication, 
and optionally, one or more selectable markers (e.g. , an antibiotic 
resistance gene). Expression vectors containing the coding sequences, or 

15 portions thereof, of a SP protein, are made, for example, by subcloning the 
coding portions into the EcoRI restriction site of each of the three pGEX 
vectors (glutathione S-transferase expression vectors (Smith and Johnson, 
Gene 7:31-40 (1988)). This allows for the expression of products in the 
correct reading frame. Vectors and systems for expression of the 

20 protease domains of the SP proteins include the well known Pichia vectors 
(available, for example, from Invitrogen, San Diego, CA), particularly those 
designed for secretion of the encoded proteins. One exemplary vector is 
described in the EXAMPLES. 

Plasmids for transformation of E. coli cells, include, for example, the 

25 pET expression vectors (see, U.S patent 4,952,496; available from 
NOVAGEN, Madison, Wl; see, also literature published by Novagen 
describing the system). Such plasmids include pET 11a, which contains 
the T7lac promoter, T7 terminator, the inducible E. coli lac operator, and 
the lac repressor gene; pET 12a-c, which contains the T7 promoter, T7 

30 terminator, and the E. coli ompT secretion signal; and pET 15b and pET19b 
(NOVAGEN, Madison, Wl), which contain a His-Tag™ leader sequence for 
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use in purification with a His column and a thrombin cleavage site that 
permits cleavage following purification over the column, the T7-lac 
promoter region and the T7 terminator. 

The vectors are introduced into host cells, such as Pichia cells and 
5 bacterial cells, such as E. co/i, and the proteins expressed therein. Pichia 
strains, which are known and readily available, include, for example, 
GS1 15. Bacterial hosts can contain chromosomal copies of DNA encoding 
T7 RNA polymerase operably linked to an inducible promoter, such as the 
lacUV promoter (see, U.S. Patent No. 4,952,496). Such hosts include, but 

10 are not limited to, the lysogenic E. co/i strain BL21{DE3). 

Expression and production of proteins 
The CVSP16 domains, derivatives and analogs can be produced by 
various methods known in the art. For example, once a recombinant cell 
expressing a SP protein, or a domain, fragment or derivative thereof, is 

15 identified, the individual gene product can be isolated and analyzed. This 
is achieved by assays based on the physical and/or functional properties of 
the protein, including, but not limited to, radioactive labeling of the product 
followed by analysis by gel electrophoresis, immunoassay, cross-linking to 
marker-labeled product. 

20 The CVSP1 6 polypeptides can be isolated and purified by standard 

methods known in the art (either from natural sources or recombinant host 
cells expressing the complexes or proteins), including but not restricted to 
column chromatography (e.g., ion exchange, affinity, gel exclusion, 
reversed-phase high pressure and fast protein liquid), differential 

25 centrifugation, differential solubility, or by any other standard technique 
used for the purification of proteins. Functional properties can be 
evaluated using any suitable assay known in the art. 

Once a SP protein or its domain or derivative is identified, the amino 
acid sequence of the protein can be deduced from the nucleotide sequence 

30 of the gene which encodes it. In addition, domains, analogs and 

derivatives of a SP protein can be chemically synthesized by standard 
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chemical methods known in the art (e.g. see Hunkapiller et al. (1984) 
Nature 3/0:105-111). For example, a peptide corresponding to a portion 
of a SP protein, which includes the desired domain or which mediates the 
desired activity in vitro can be synthesized by use of a peptide synthesizer. 
5 Furthermore, if desired, nonclassical amino acids or chemical amino acid 
analogs can be introduced as a substitution or addition into the SP protein 
sequence. Non-classical amino acids include but are not limited to the D- 
isomers of the common amino acids, a-amino isobutyric acid, 4- 
aminobutyric acid, Abu, 2-aminobutyric acid, e-Abu, e-Ahx, 6-amino 

10 hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionoic acid, 
ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, 
cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, 
cyclohexylalanine, B-alanine, fluoro-amino acids, designer amino acids such 
as fc-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, 

15 and amino acid analogs in general. The amino acids can be D (dextro- 
rotatory) or L (levorotatory). 

Manipulations of SP protein sequences can be made at the protein 
level. Also contemplated herein are SP proteins, domains thereof, 
derivatives or analogs or fragments thereof, which are differentially 

20 modified during or after translation, e.g. , by glycosylation, acetylation, 
phosphorylation, amidation, derivatization by known protecting/blocking 
groups, proteolytic cleavage, linkage to an antibody molecule or other 
cellular ligand. Any of numerous chemical modifications can be carried out 
by known techniques, including but not limited to specific chemical 

25 cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 
protease, NaBH 4 , acetylation, formylation, oxidation, reduction and 
metabolic synthesis in the presence of tunicamycin. 
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In cases where natural products are suspected of having a mutation 
or are isolated from new species, the amino acid sequence of the SP 
protein isolated from the natural source, as well as those expressed in 
vitro, or from synthesized expression vectors in vivo or in vitro, can be 
5 determined from analysis of the DNA sequence, or alternatively, by direct 
sequencing of the isolated protein. Such analysis can be performed by 
manual sequencing or through use of an automated amino acid sequenator. 

In particular, the protease domain (or full length or other fragment) 
of the CVSP16 can be expressed intracellular^ without a signal sequence, 
10 which results in accumulation or formation of inclusion bodies containing 
the protease domain. The inclusion bodies are isolated, denatured, 
solubilized and refolded to yield a protease domain (or full length or other 
fragment), which is then activated by cleavage at the activation cleavage 
site. 

15 Modifications 

A variety of modifications of the SP proteins and domains are 
contemplated herein. A SP-encoding nucleic acid molecule can be modified 
by any of numerous strategies known in the art (Sambrook et al. (1989) 
Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor 

20 Laboratory, Cold Spring Harbor, New York, and Sambrook et al. (2001) 
Molecular Cloning: A Laboratory Manual (3rd Ed.), Vol. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y.). The sequences can be 
cleaved at appropriate sites with restriction endonuclease(s), followed by 
further enzymatic modification if desired, isolated, and ligated in vitro. In 

25 the production of the gene encoding a domain, derivative or analog of SP, 
care should be taken to ensure that the modified gene retains the original 
translational reading frame, uninterrupted by translational stop signals, in 
the gene region where the desired activity is encoded. 

Additionally, the SP-encoding nucleic acid molecules can be mutated 

30 in vitro or in vivo, to create and/or destroy translation, initiation, and/or 
termination sequences, or to create variations in coding regions and/or 
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form new restriction endonuclease sites or destroy pre-existing ones, to 
facilitate further in vitro modification. Also, as described herein muteins 
with primary sequence alterations, such as replacements of Cys residues 
and elimination of glycosylation sites are contemplated. Such mutations 
5 can be effected by any technique for mutagenesis known in the art, 
including, but not limited to, chemical mutagenesis and in vitro site- 
directed mutagenesis (Hutchinson et al., J. Biol. Chem. 253:6551-6558 
(1978)), use of TAB® linkers (Pharmacia). In one embodiment, for 
example, a SP protein or domain thereof is modified to include a 

10 fluorescent label. In other specific embodiments, the SP protein is modified 
to have a heterofunctional reagent, which can be used to crosslink the 
members of the complex. 
F. Screening methods 

The single chain protease domains can be used in a variety of 

15 methods to identify compounds that modulate the activity thereof. For SPs 
that exhibit higher activity or expression in tumor cells, compounds that 
inhibit the proteolytic activity are of particular interest. For any SPs that 
are active at lower levels in tumor cells, compounds or agents that 
enhance the activity are potentially of interest. In all instances the 

20 identified compounds include agents that are candidate cancer treatments. 

Several types of assays are described herein. It is understood that 
the protease domains can be used in other assays. A single-chain protease 
domain can exhibit catalytic activity. As such the protease domains are 
useful for in vitro screening assays, including, for example in binding 

25 assays. 

The CVSP16 full length zymogens, activated enzymes, single-, two- 
and three-chain protease domains are contemplated for use in any 
screening assay known to those of skill in the art, including those provided 
herein. Hence the following description, if directed to proteolytic assays is 
30 intended to apply to use of an active single-chain protease domain or a 
catalytically active portion thereof. Other assays, such as binding assays 
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are provided herein, particularly for use with a CVSP16, including any 

variants, such as splice variants thereof. 

1 . Catalytic Assays for identification of agents that modulate the 
protease activity of a SP protein 

5 Methods for identifying a modulator of the catalytic activity of a SP, 

particularly a single chain protease domain or catalytically active portion 

thereof, are provided herein. The methods can be practiced by: a) 

contacting the CVSP16, a full-length zymogen or activated form, and 

particularly a single-chain domain thereof, with a substrate of the CVSP16 

10 in the presence of a test substance, and detecting the proteolysis of the 
substrate, whereby the activity of the CVSP1 6 is assessed, and comparing 
the activity to a control. For example, the control can be the activity of 
the CVSP16 assessed by contacting a CVSP16, including a full-length 
zymogen or activated form, and particularly a single-chain domain thereof, 

15 with a substrate of the CVSP16, and detecting the proteolysis of the 

substrate, whereby the activity of the CVSP16 is assessed. The results in 
the presence and absence of the test compounds are compared. A 
difference in the activity indicates that the test substance modulates the 
activity of the CVSP16. Modulators, such as activators, of CVSP16 

20 activity or activation also are contemplated; such assays are discussed 
below. 

In one embodiment a plurality of the test substances are screened 
simultaneously in the above screening method. In another embodiment, 
the CVSP16 is isolated from a target cell as a means for then identifying 

25 agents that are potentially specific for the target cell. 

In another embodiment, a test substance is potentially a therapeutic 
compound. A difference in CVSP16 activity is measured in the presence 
and in the absence of the test substance. A difference indicates that the 
target cell responds to the compound. 

30 One method includes the steps of (a) contacting the CVSP16 

polypeptide or protease domain thereof with one or a plurality of test 
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compounds under conditions conducive to interaction between the ligand 
and the compounds; and (b) identifying one or more compounds in the 
plurality that specifically binds to the ligand. 

Another method provided herein includes the steps of a) contacting 
5 a CVSP16 polypeptide or protease domain thereof with a substrate of the 
CVSP1 6 polypeptide, and detecting the proteolysis of the substrate, 
whereby the activity of the CVSP1 6 polypeptide is assessed; b) contacting 
the CVSP16 polypeptide with a substrate of the CVSP16 polypeptide in the 
presence of a test substance, and detecting the proteolysis of the 

10 substrate, whereby the activity of the CVSP16 polypeptide is assessed; 

and c) comparing the activity of the CVSP16 polypeptide assessed in steps 
a) and b), whereby the activity measured in step a) differs from the activity 
measured in step b) indicates that the test substance modulates the 
activity of the CVSP16 polypeptide. 

15 In another embodiment, a plurality of the test substances are 

screened simultaneously. In comparing the activity of a CVSP16 
polypeptide in the presence and absence of a test substance to assess 
whether the test substance is a modulator of the CVSP16 polypeptide, it is 
unnecessary to assay the activity in parallel, although such parallel 

20 measurement is typical. It is possible to measure the activity of the 

CVSP1 6 polypeptide at one time point and compare the measured activity 
to a historical value of the activity of the CVSP16 polypeptide. 

For instance, one can measure the activity of the CVSP16 
polypeptide in the presence of a test substance and compare with historical 

25 value of the activity of the CVSP1 6 polypeptide measured previously in the 
absence of the test substance, and wee versa. This can be accomplished, 
for example, by providing the activity of the CVSP16 polypeptide on an 
insert or pamphlet provided with a kit for conducting the assay. Methods 
for selecting substrates for a particular SP are described in the EXAMPLES, 

30 and particular proteolytic assays are described. 
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Combinations and kits containing the combinations optionally 
including instructions for performing the assays are provided. The 
combinations include a CVSP16 polypeptide and a substrate of the 
CVSP1 6 polypeptide to be assayed; and, optionally reagents for detecting 
5 proteolysis of the substrate. The substrates, which can be chromogenic or 
fluorogenic molecules, including proteins, subject to proteolysis by a 
particular CVSP1 6 polypeptide, can be identified empirically by testing the 
ability of the CVSP1 6 polypeptide to cleave the test substrate. Substrates 
that are cleaved most effectively (i.e., at the lowest concentrations and/or 
10 fastest rate or under desirable conditions), are identified. 

Additionally provided herein is a kit containing the above-described 
combination. The kit optionally includes instructions for identifying a 
modulator of the activity of a CVSP16 polypeptide. Any CVSP16 
polypeptide is contemplated as target for identifying modulators of the 
15 activity thereof. 

2. Binding assays 

Also provided herein are methods for identification and isolation of 
agents, particularly compounds that bind to CVSP16s. The assays are 
designed to identify agents that bind to the zymogen form, the single chain 

20 isolated protease domain (or a protein, other than a CVSP16 polypeptide, 
that contains the protease domain of a CVSP16 polypeptide), and to the 
activated form, including the activated form derived from the full length 
zymogen or from a polypeptide that contains the protease domain. The 
identified compounds are candidates or leads for identification of 

25 compounds for treatments of tumors and other disorders and diseases 
involving aberrant proliferation and/or angiogenesis. The CVSP16 
polypeptides used in the methods include any CVSP16 polypeptide as 
defined herein, including the CVSP16 single chain protease domain or 
proteolytically active portion thereof. 

30 A variety of methods are provided herein. These methods can be 

performed in solution or in solid phase reactions in which the CVSP16 
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polypeptide(s) or protease domain(s) thereof are linked, either directly or 
indirectly via a linker, to a solid support. Screening assays are described 
in the Examples, and these assays can be used to identify candidate 
compounds. For purposes herein, all binding assays described above are 
5 provided for CVSP16. 

Methods for identifying an agent, such as a compound, that 
specifically binds to a CVSP16 single chain protease domain, a zymogen or 
full-length activated CVSP16 and/or two-chain protease domain thereof or 
other polypeptides provided herein are provided. The method can be 

10 practiced by (a) contacting the CVSP16 with one or a plurality of test 

agents under conditions conducive to binding between the CVSP16 and an 
agent; and (b) identifying one or more agents within the plurality that 
specifically binds to one or more CVSP16 forms. 

For example, in practicing such methods the CVSP1 6 polypeptide is 

15 mixed with a potential binding partner or an extract or fraction of a cell 
under conditions that allow the association of potential binding partners 
with the polypeptide. After mixing, peptides, polypeptides, proteins or 
other molecules that have become associated with a CVSP1 6 are 
separated from the mixture. The binding partner that bound to the 

20 CVSP16 can then be removed and further analyzed. To identify and isolate 
a binding partner, the entire protein, for instance the entire disclosed 
protein of SEQ ID Nos. 6 can be used. Alternatively, a fragment of the 
protein can be used. 

A variety of methods can be used to obtain cell extracts or body 

25 fluids, such as blood, serum, urine, sweat, synovial fluid, CSF and other 
such fluids. For example, cells can be disrupted using either physical or 
chemical disruption methods. Examples of physical disruption methods 
include, but are not limited to, sonication and mechanical shearing. 
Examples of chemical lysis methods include, but are not limited to, 

30 detergent lysis and enzyme lysis. A skilled artisan can readily adapt 
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methods for preparing cellular extracts in order to obtain extracts for use in 
the present methods. 

Once an extract of a cell is prepared, the extract is mixed with the 
CVSP1 6 under conditions in which association of the protein with the 
5 binding partner can occur. A variety of conditions can be used, including 
conditions that resemble conditions found in the cytoplasm of a human 
cell. Features such as osmolarity, pH, temperature, and the concentration 
of cellular extract used, can be varied to optimize the association of the 
protein with the binding partner. Similarly, methods for isolation of 

10 molecules of interest from body fluids are known. 

After mixing under appropriate conditions, the bound complex is 
separated from the mixture. A variety of techniques can be used to 
separate the mixture. For example, antibodies specific to a CVSP16 can 
be used to immunoprecipitate the binding partner complex. Alternatively, 

15 standard chemical separation techniques such as chromatography and 
density/sediment centrifugation can be used. 

After removing the non-associated cellular constituents in the 
extract, the binding partner can be dissociated from the complex using 
conventional methods. For example, dissociation can be accomplished by 

20 altering the salt concentration and/or pH of the mixture. 

To aid in separating associated binding partner pairs from the mixed 
extract, the CVSP16 can be immobilized on a solid support. For example, 
the protein can be attached to a nitrocellulose matrix or acrylic beads. 
Attachment of the protein or a fragment thereof to a solid support aids in 

25 separating peptide/binding partner pairs from other constituents found in 

the extract. The identified binding partners can be either a single protein or 
a complex made up of two or more proteins. 

Alternatively, the nucleic acid molecules encoding the single chain 
proteases can be used in a yeast two-hybrid system. The yeast two-hybrid 

30 system has been used to identify other protein partner pairs and can readily 
be adapted to employ the nucleic acid molecules herein described. 
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Another in vitro binding assay, particularly for a CVSP1 6, uses a 
mixture of a polypeptide that contains at least the catalytic domain of one 
of these proteins and one or more candidate binding targets or substrates. 
After incubating the mixture under appropriate conditions, the ability of the 
5 CVSP16 or a polypeptide fragment thereof containing the catalytic domain 
to bind to or interact with the candidate substrate is assessed. For cell- 
free binding assays, one of the components includes or is coupled to a 
detectable label. The label can provide for direct detection, such as 
radioactivity, luminescence, including fluorescence, optical or electron 

10 density, or indirect detection such as an epitope tag and an enzyme. A 
variety of methods can be employed to detect the label depending on the 
nature of the label and other assay components. For example, the label 
can be detected bound to the solid substrate or a portion of the bound 
complex containing the label can be separated from the solid substrate, 

15 and the label thereafter detected. 

3. Detection of signal transduction 

Secreted CVSPs, such as CVSP16, can be involved in signal trans- 
duction either directly by binding to or interacting with a cell surface 
receptor or indirectly by activating proteins, such as pro-growth factors 

20 that can initiate signal transduction. Assays for assessing signal 

transduction are well known to those of skill in the art, and can be adapted 
for use with the CVSP1 6 polypeptide. 

Assays for identifying agents that affect or alter signal transduction 
mediated directly or indirectly, such as via activation of a pro-growth 

25 factor, by a CVSP1 6, particularly the full length or a portion sufficient to 
exhibit proteolytic or binding activity. Such assays, include, for example, 
transcription based assays in which modulation of a transduced signal is 
assessed by detecting an effect on an expression from a reporter gene 
(see, e.g., U.S. Patent No. 5,436,128). 
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4. Methods for Identifying Agents that Modulate the Expression 
a Nucleic Acid Encoding a CVSP16 

Another embodiment provides methods for identifying agents that 

modulate the expression of a nucleic acid encoding a CVSP16. Such 

5 assays use any available means of monitoring for changes in the 

expression level of the nucleic acids encoding a CVSP16. 

In one assay format, cell lines that contain reporter gene fusions 

between the open reading frame of CVSP1 6 or a domain thereof, 

particularly the protease domain and any assayable fusion partner can be 

10 prepared. Numerous assayable fusion partners are known and readily 
available including the firefly luciferase gene and the gene encoding 
chloramphenicol acetyltransferase (Alam et al.. Anal. Biochem. 188: 245- 
54 (1990)). Cell lines containing the reporter gene fusions are then 
exposed to the agent to be tested under appropriate conditions and time. 

15 Differential expression of the reporter gene between samples exposed to 
the agent and control samples identifies agents which modulate the 
expression of a nucleic acid encoding a CVSP16. 

Additional assay formats can be used to monitor the ability of the 
agent to modulate the expression of a nucleic acid encoding a CVSP16. 

20 For instance, mRNA expression can be monitored directly by hybridization 
to the nucleic acids. Cell lines are exposed to the agent to be tested under 
appropriate conditions and time and total RNA or mRNA is isolated by 
standard procedures (see, e.g., Sambrook et al. (2001) Molecular Cloning: 
A Laboratory Manual (3rd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, 

25 Cold Spring Harbor, N.Y.). Probes to detect differences in RNA expression 
levels between cells exposed to the agent and control cells can be 
prepared from the nucleic acids. It is typical, but not necessary, to design 
probes which hybridize only with target nucleic acids under conditions of 
high stringency. Only highly complementary nucleic acid hybrids form 

30 under conditions of high stringency. Accordingly, the stringency of the 
assay conditions determines the amount of complementarity which should 
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exist between two nucleic acid strands in order to form a hybrid. 
Stringency should be chosen to maximize the difference in stability 
between the probertarget hybrid and potential probe:non-target hybrids. 
Probes can be designed from the nucleic acids through methods 
5 known in the art. For instance, the G + C content of the probe and the 
probe length can affect probe binding to its target sequence. Methods to 
optimize probe specificity are commonly available (see, e.g., Sambrook et 
a/. (2001) Molecular Cloning: A Laboratory Manual (3rd Ed.), Vol. 1-3, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y.; Sambrook eta/. 

10 (1989) MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. Cold 
Spring Harbor Laboratory Press); and Ausubel eta/. (1995) CURRENT 
PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Co., NY). 

Hybridization conditions are modified using known methods ((see, 
e.g., Sambrook et a/. (2001) Molecular Cloning: A Laboratory Manual (3rd 

15 Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.; 
Sambrook eta/. (1989) MOLECULAR CLONING: A LABORATORY 
MANUAL, 2nd Ed. Cold Spring Harbor Laboratory Press); and Ausubel et 
a/. (1995) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene 
Publishing Co., NY), as required for each probe. Hybridization of total 

20 cellular RNA or RNA enriched for polyA RNA can be accomplished in any 
available format. For instance, total cellular RNA or RNA enriched for 
polyA RNA can be affixed to a solid support, and the solid support exposed 
to at least one probe comprising at least one, or part of one of the nucleic 
acid molecules under conditions in which the probe specifically hybridizes. 

25 Alternatively, nucleic acid fragments comprising at least one, or part of one 
of the sequences can be affixed to a solid support, such as a porous glass 
wafer. The glass wafer can then be exposed to total cellular RNA or polyA 
RNA from a sample under conditions in which the affixed sequences 
specifically hybridize. Such glass wafers and hybridization methods are 

30 widely available, for example, those disclosed by Beattie (WO 95/1 1755). 
By examining for the ability of a given probe to specifically hybridize to an 
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RNA sample from an untreated cell population and from a cell population 
exposed to the agent, agents which up or down regulate the expression of 
a nucleic acid encoding the CVSP16 polypeptide, are identified. 

In one format, the relative amounts of a protein between a cell 
5 population that has been exposed to the agent to be tested compared to an 
un-exposed control cell population can be assayed {e.g., a prostate cancer 
cell line, a lung cancer cell line, a colon cancer cell line or a breast cancer 
cell line). In this format, probes, such as specific antibodies, are used to 
monitor the differential expression or level of activity of the protein in the 

10 different cell populations or body fluids. Cell lines or populations or body 
fluids are exposed to the agent to be tested under appropriate conditions 
and time. Cellular lysates or body fluids can be prepared from the exposed 
cell line or population and from a control, unexposed cell line or population 
or unexposed body fluid. The cellular lysates or body fluids are then 

15 analyzed with the probe. 

For example, N- and C- terminal fragments of the CVSP16 can be 
expressed in bacteria and used to search for proteins which bind to these 
fragments. Fusion proteins, such as His-tag or GST fusion to the N- or C- 
terminal regions of the CVSP1 6 can be prepared for use as a binding 

20 partner. These fusion proteins can be coupled to, for example, 

Glutathione-Sepharose beads and then probed with cell lysates or body 
fluids. Prior to lysis, the cells or body fluids can be treated with a 
candidate agent which can modulate a CVSP16 or proteins that interact 
with domains thereon. Lysate proteins binding to the fusion proteins can 

25 be resolved by SDS-PAGE, isolated and identified by protein sequencing or 
mass spectroscopy, as is known in the art. 

Antibody probes are prepared by immunizing suitable mammalian 
hosts in appropriate immunization protocols using the peptides, 
polypeptides or proteins if they are of sufficient length (e.g., 4, 5, 6, 7, 8, 

30 9, 10, 1 1, 12, 13, 14, 15, 20, 25, 30, 35, 40 or more consecutive amino 
acids of the CVSP16 polypeptide or if required to enhance immunogenicity, 
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conjugated to suitable carriers. Methods for preparing immunogenic 
conjugates with carriers, such as bovine serum albumin (BSA), keyhole 
limpet hemocyanin (KLH), or other carrier proteins are well known in the 
art. In some circumstances, direct conjugation using, for example, 
5 carbodiimide reagents can be effective; in other instances linking reagents 
such as those supplied by Pierce Chemical Co., Rockford, IL, can be 
desirable to provide accessibility to the hapten. Hapten peptides can be 
extended at either the amino or carboxy terminus with a Cys residue or 
interspersed with cysteine residues, for example, to facilitate linking to a 

10 carrier. Administration of the immunogens is conducted generally by 

injection over a suitable time period and with use of suitable adjuvants, as 
is generally understood in the art. During the immunization schedule, titers 
of antibodies are taken to determine adequacy of antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides 

15 corresponding to, for example, the carboxy terminal amino acids of the 
CVSP16. Synthetic peptides can be as small as 1-3 amino acids in length, 
generally at least 4 or more amino acid residues long. The peptides can be 
coupled to KLH using standard methods and can be immunized into 
animals, such as rabbits or ungulate. Polyclonal antibodies then can then 

20 be purified, for example using Actigel beads containing the covalently 
bound peptide, or other reagents for affinity purification methods or by 
purifying all of the IgGs, using Protein A or Protein G columns or other 
such methods. 

While the polyclonal antisera produced in this way can be 
25 satisfactory for some applications. For pharmaceutical compositions 

monoclonal preparations are generally used. Immortalized cell lines which 
secrete the desired monoclonal antibodies can be prepared using the 
standard method of Kohler et al. f {Nature 256: 495-7 (1975)) or 
modifications which effect immortalization of lymphocytes or spleen cells, 
30 as is generally known. The immortalized cell lines secreting the desired 
antibodies are screened by immunoassay in which the antigen is the 
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peptide hapten, polypeptide or protein. When the appropriate immortalized 
cell culture secreting the desired antibody is identified, the cells can be 
cultured either in vitro or by production in vivo via ascites fluid. Of 
particular interest, are monoclonal antibodies that recognize the catalytic 
5 domain of a CVSP16. 

Additionally, the zymogen, a single chain or a three-chain or a two- 
chain form of the CVSP16 can be used to make monoclonal antibodies that 
recognize conformational epitopes. The desired monoclonal antibodies are 
then recovered from the culture supernatant or from the ascites 

10 supernatant. Fragments of the monoclonals or the polyclonal antisera that 
contain the antigen binding portion can be used as antagonists, as well as 
the intact antibodies. Immunologically reactive fragments, such as the 
Fab, Fab', or F(ab') 2 fragments are often used, especially in a therapeutic 
context, as these fragments are generally less immunogenic than the whole 

15 immunoglobulin. Regions that bind specifically to the desired regions of 

receptor also can be produced in the context of chimeras with multiple 

species origin. Fully human antibodies also can be prepared using, for 

example, either transgenic mice that contain human immunoglobulin genes 

or phage display libraries of human antibodies. 

20 G. Assay formats and selection of test substances that modulate at 
least one activity of a CVSP16 polypeptide 

Methods for identifying agents that modulate at least one activity of 
a CVSP16 are provided. Methods include phage display and include other 
methods for assessing alterations in the activity of an CVSP16. Such 
25 methods or assays can use any means of monitoring or detecting the 

desired activity. A variety of formats and detection protocols are known 
for performing screening assays. Any such formats and protocols can be 
adapted for identifying modulators of CVSP16 polypeptide activities. The 
following includes a discussion of exemplary protocols. 
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1 . High throughput screening assays 

Although the above-described assay can be conducted where a 
single CVSP16 polypeptide is screened, and/or a single test substance is 
screened in one assay, the assay typically is conducted in a high 
5 throughput screening mode, i.e., a plurality of the SP proteins are screened 
against and/or a plurality of the test substances are screened 
simultaneously {See generally. High Throughput Screening: The Discovery 
of Bioactive Substances (Devlin, Ed.) Marcel Dekker, 1997; Sittampalam et 
at., Curr. Opin. Chem. Biol., 7:384-91 (1997); and Silverman et al., Cum 

10 Opin. Chem. Biol., 2:397-403 (1998)). For example, the assay can be 
conducted in a multi-well (e.g., 24-, 48-, 96-, 384-, 1536-well or higher 
density), chip or array format. 

High-throughput screening (HTS) is the process of testing a large 
number of diverse chemical structures against disease targets to identify 

15 "hits" (Sittampalam et al., Curr. Opin. Chem. Biol., 7:384-91 (1997)). 
Current state-of-the-art HTS operations are highly automated and 
computerized to handle sample preparation, assay procedures and the 
subsequent processing of large volumes of data. 

Detection technologies employed in high-throughput screens depend 

20 on the type of biochemical pathway being investigated (Sittampalam et al., 
Curr. Opin. Chem. Biol., 7:384-91 (1997)). These methods include, 
radiochemical methods, such as the scintillation proximity assays (SPA), 
which can be adapted to a variety of enzyme assays (Lerner et al., J. 
Biomol. Screening, 7:135-143 (1996); Baker et al. , Anal. Biochem., 

25 235:20-24 (1996); Baum eta/., Anal. Biochem., 237:129-134 (1996); and 
Sullivan et al., J. Biomol. Screening 2:19-23 (1997)) and protein-protein 
interaction assays (Braunwalder et al., J. Biomol. Screening 7:23-26 
(1996); Sonatore et al., Anal. Biochem. 240:289-297 (1996); and Chen et 
al., J. Biol. Chem. 277:25308-25315 (1996)), and non-isotopic detection 

30 methods, including but are not limited to, colorimetric and luminescence 
detection methods, resonance energy transfer (RET) methods, time- 
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resolved fluorescence (HTRF) methods, cell-based fluorescence assays, 
such as fluorescence resonance energy transfer (FRET) procedures (see, 
e. g. t Gonzalez et aL, Biophys. J., 65:1272-1280 (1995)), fluorescence 
polarization or anisotropy methods (see, e.g., Jameson et aL, Methods 
5 Enzymol. 246:283-300 (1995); Jolley, J. Biomol. Screening 7:33-38 
(1996); Lynch et aL, AnaL Biochem. 247:77-82 (1997)), fluorescence 
correlation spectroscopy (FCS) and other such methods. 
2. Test Substances 
Test compounds, including small molecules, antibodies, proteins, 

10 nucleic acids, peptides, natural products, mixtures of natural products, 

derivatives {e.g, chemical derivatives) of natural products, and libraries and 
collections thereof, can be screened in the above-described assays and 
assays described below to identify compounds that modulate the activity 
of a CVSP16 polypeptide. Rational drug design methodologies that rely on 

15 computational chemistry can be used to screen and identify candidate 
compounds. 

Test compounds (agents) that are assayed in the methods can be 
produced and obtained by any method known to those of skill in the art. 
For example, they can be randomly selected or rationally selected or 

20 designed. The agents can be, as examples, peptides, small molecules, 
and carbohydrates. A skilled artisan can readily recognize that there is no 
limit to the structural nature of the agents. The peptide agents can be 
prepared using standard solid phase (or solution phase) peptide synthesis 
methods, as is known in the art. In addition, the DNA encoding these 

25 peptides can be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantly using standard 
recombinant production systems. The production using solid phase peptide 
synthesis is necessitated if non-gene-encoded amino acids are to be 
included. 
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The compounds identified by the screening methods include 
inhibitors, including antagonists, and agonists. Compounds for screening 
include any compounds and collections of compounds available, known or 
that can be prepared. 
5 a. Selection of Compounds 

Compounds can be selected for their potency and selectivity of 
inhibition of serine proteases, especially a CVSP16 polypeptide. As 
described herein, and as generally known, a target serine protease and its 
substrate are combined under assay conditions permitting reaction of the 

10 protease with its substrate. The assay is performed in the absence of test 
compound, and in the presence of increasing concentrations of the test 
compound. The concentration of test compound at which 50% of the 
serine protease activity is inhibited by the test compound is the IC 50 value 
(Inhibitory Concentration) or EC 50 (Effective Concentration) value for that 

15 compound. Within a series or group of test compounds, those having 
lower IC 50 or EC 50 values are considered more potent inhibitors of the 
serine protease than those compounds having higher IC 50 or EC 50 values. 
The IC 50 measurement is often used for more simplistic assays, whereas 
the EC 50 is often used for more complicated assays, such as those 

20 employing cells. 

Typically candidate compounds have an IC 50 value of 100 nM or less 
as measured in an in vitro assay for inhibition of CVSP1 6 polypeptide 
activity. The test compounds also are evaluated for selectivity toward a 
serine protease. As described herein, and as generally known, a test 

25 compound is assayed for its potency toward a panel of serine proteases 
and other enzymes and an IC 50 value or EC 50 value is determined for each 
test compound in each assay system. A compound that demonstrates a 
low IC 50 value or EC 50 value for the target enzyme, e.g., CVSP1 6 
polypeptide, and a higher IC 50 value or EC 50 value for other enzymes within 

30 the test panel (e.g., urokinase tissue plasminogen activator, thrombin, 
Factor Xa), is considered to be selective toward the target enzyme. 
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Generally, a compound is deemed selective if its IC 50 value or EC 50 value in 
the target enzyme assay is at least 2-fold, 5-fold, 10-fold (or higher-fold) 
less than the next smallest IC 50 value or EC 50 value measured in the 
selectivity panel of enzymes. 
5 Compounds are also evaluated for their activity in vivo. The type of 

assay chosen for evaluation of test compounds depends on the 
pathological condition to be treated or prevented by use of the compound, 
as well as the route of administration to be evaluated for the test 
compound. 

10 For instance, to evaluate the activity of a compound to reduce tumor 

growth through inhibition of CVSP16 polypeptide, the procedures 
described by Jankun et al., Cane. Res. 57:559-563 (1997). Briefly, the 
ATCC cell lines DU145 and LnCaP are injected into SCID mice. After 
tumors are established, the mice are given test compound according to a 

15 dosing regime determined from the compound's in vitro characteristics. 
The Jankun et al. compound was administered in water. Tumor volume 
measurements are taken twice a week for about five weeks. A compound 
is deemed active if an animal to which the compound was administered 
exhibited decreased tumor volume, as compared to animals receiving 

20 appropriate control compounds. 

Another in vivo experimental model designed to evaluate the effect 
of p-aminobenzamidine, a swine protease inhibitor, on reducing tumor 
volume is described by Billstrom et al., Int. J. Cancer 57:542-547 (1995). 
To evaluate the ability of a compound to reduce the occurrence of, 

25 or inhibit, metastasis, the procedures described by Kobayashi et al. Int. J. 
Cane. 57:727-733d (1994) can be employed. Briefly, a murine xenograft 
selected for high lung colonization potential is injected into C57B1/6 mice 
i.v. (experimental metastasis) or s.c. into the abdominal wall (spontaneous 
metastasis). Various concentrations of the compound to be tested can be 

30 admixed with the tumor cells in Matrigel prior to subcutaneous injection. 
Daily i.p. injections of the test compound are made either on days 1-6 or 
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days 7-13 after tumor inoculation. The animals are sacrificed about three 
or four weeks after tumor inoculation, and the lung tumor colonies are 
counted. Evaluation of the resulting data permits a determination as to 
efficacy of the test compound, optimal dosing and route of administration. 
5 The activity of the tested compounds toward decreasing tumor 

volume and metastasis can be evaluated in a model, such as that described 
in Rabbani et aL, Int. J. Cancer 63:840-845 (1995) to evaluate its 
inhibitor. There, Mat LyLu tumor cells were injected into the flank of 
Copenhagen rats. The animals were implanted with osmotic minipumps to 

10 continuously administer various doses of test compound for up to three 
weeks. The tumor mass and volume of experimental and control animals 
were evaluated during the experiment, as were metastatic growths. 
Evaluation of the resulting data permits a determination as to efficacy of 
the test compound, optimal dosing, and route of administration. Some of 

15 these authors described a related protocol in Xing et aL, Cane. Res. 
57:3585-3593 (1997). 

To evaluate the anti-angiogenesis activity of a compound, a rabbit 
cornea neovascularization model can be employed (see, e.g., Avery eta/. 
(1990) Arch. Ophthalmol. 708: 1474-1 47). Avery et al. describes 

20 anesthetizing New Zealand albino rabbits and then making a central corneal 
incision and forming a radial corneal pocket. A slow release prostaglandin 
pellet was placed in the pocket to induce neovascularization. Test 
compound was administered i.p. for five days, at which time the animals 
were sacrificed. The effect of the test compound is evaluated by review of 

25 periodic photographs taken of the limbus, which can be used to calculate 
the area of neovascular response and, therefore, limbal neovascularization. 
A decreased area of neovascularization as compared with appropriate 
controls indicates the test compound was effective at decreasing or 
inhibiting neovascularization. 

30 An angiogenesis model used to evaluate the effect of a test 

compound in preventing angiogenesis is described by Min et al. Cane. Res. 
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56:2428-2433 (1996). C57BL6 mice receive subcutaneous injections of a 
Matrigel mixture containing bFGF, as the angiogenesis-inducing agent, with 
and without the test compound. After five days, the animals are sacrificed 
and the Matrigel plugs, in which neovascularization can be visualized, are 
5 photographed. An experimental animal receiving Matrigel and an effective 
dose of test compound exhibits less vascularization than a control animal 
or an experimental animal receiving a less- or non-effective dose of 
compound. 

An in vivo system designed to test compounds for their ability to 

10 limit the spread of primary tumors is described by Crowley et al., Proc. 

Natl. Acad. Sci. 50:5021-5025 (1993). Nude mice are injected with tumor 
cells (PC3) engineered to express CAT (chloramphenicol acetyltransf erase). 
Compounds to be tested for their ability to decrease tumor size and/or 
metastases are administered to the animals, and subsequent measurements 

15 of tumor size and/or metastatic growths are made. In addition, the level of 
CAT detected in various organs provides an indication of the ability of the 
test compound to inhibit metastasis; detection of less CAT in tissues of a 
treated animal versus a control animal indicates less CAT-expressing cells 
migrated to that tissue. 

20 In vivo experimental models designed to evaluate the inhibitory 

potential of a test serine protease inhibitors, using a tumor cell line F3II 
known to be highly invasive (see, e.g., Alonso et a/., Breast Cane. Res. 
Treat. 40:209-223 (1996)) can be used. Alonso describes in vivo studies 
for toxicity determination, tumor growth, invasiveness, spontaneous 

25 metastasis, experimental lung metastasis, and an angiogenesis assay. 

The CAM model (chick embryo chorioallantoic membrane model), 
first described by L. Ossowski in 1998 {J. Cell Biol. 707:2437-2445 
(1988)), provides another method for evaluating the inhibitory activity of a 
test compound. In the CAM model, tumor cells invade through the 

30 chorioallantoic membrane. Administration of several serine protease 
inhibitors resulted in less or no invasion of the tumor cells through the 
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membrane. Thus, the CAM assay is performed with CAM and tumor cells 
in the presence and absence of various concentrations of test compound. 
The invasiveness of tumor cells is measured under such conditions to 
provide an indication of the compound's inhibitory activity. A compound 
5 having inhibitory activity correlates with less tumor invasion. 

The CAM model also is used in a standard assay of angiogenesis 
(i.e., effect on formation of new blood vessels (see, e.g., Brooks et al. 
Methods in Molecular Biology 723:257-269 (1999)). In this model, a filter 
disc containing an angiogenesis inducer, such as basic fibroblast growth 

10 factor (bFGF) is placed onto the CAM. Diffusion of the cytokine into the 
CAM induces local angiogenesis, which can be measured in several ways 
such as by counting the number of blood vessel branch points within the 
CAM directly below the filter disc. The ability of identified compounds to 
inhibit cytokine-induced angiogenesis can be tested using this model. A 

15 test compound can either be added to the filter disc that contains the 
angiogenesis inducer, be placed directly on the membrane or be 
administered systemically. The extent of new blood vessel formation in 
the presence and/or absence of test compound can be compared using this 
model. The formation of fewer new blood vessels in the presence of a test 

20 compound would be indicative of anti-angiogenesis activity. A 

demonstration of anti-angiogenesis activity for inhibitors of a CVSP16 
polypeptide is indicative of a role in angiogenesis for that SP protein. 

b. Known serine protease inhibitors 
Compounds for screening can be serine protease inhibitors, which 

25 can be tested for their ability to inhibit the activity of a CVSP16. 

Exemplary serine protease inhibitors for use in the screening assays 
include, but are not limited to: Serine Protease Inhibitor 3 (SPI-3) (Chen, et 
al. Cytokine, //:856-862 (1999)); Aprotinin (lijima, R., eta/., J. Biochem. 
(Tokyo) 725:912-916 (1999)); Kazal-type serine protease inhibitor-like 

30 proteins (Niimi, et al. Eur. J. Biochem., 266:282-292 (1999)); Kunitz-type 
serine protease inhibitor (Ravichandran, S., eta/., Acta Crystallogr. D. Biol. 
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Crystallogr., 55:1814-1821 (1999)); Tissue factor pathway inhibitor- 
2/Matrix-associated serine protease inhibitor (TFPI-2/MSPI), (Liu, Y. eta/. 
Arch. Biochem. Biophys. 370:112-8 (1999)); Bukunin (Cui, C.Y. et al. J. 
Invest. Dermatol. 1 13: 1 82-8 (1 999)); Nafmostat mesilate (Ryo, R. et al. 
5 Vox Sang. 76:241-6 (1999)); TPCK (Huang et al. Oncogene 73:3431-3439 
(1999)); A synthetic cotton-bound serine protease inhibitor (Edwards et al. 
Wound Repair Regen. 7:106-18 (1999)); FUT-175 (Sawada, M. et al. 
Stroke 30:644-50 (1999)); Combination of serine protease inhibitor FUT- 
0175 and thromboxane synthetase inhibitor OKY-046 (Kaminogo et al. 

10 Neurol. Med. Chir. (Tokyo) 33:704-8; discussion 708-9 (1998)); The rat 
serine protease inhibitor 2.1 gene (LeCam, A., et al., Biochem. Biophys. 
Res. Commun., 253:31 1-4 (1998)); A new intracellular serine protease 
inhibitor expressed in the rat pituitary gland complexes with granzyme B 
(Hill et al. FEBS Lett. 440:361-4 (1998)); 3,4-Dichloroisocoumarin 

15 (Hammed et al. Proc. Soc. Exp. Biol. Med., 2 73:132-7 (1998)); LEX032 

(Bains et al. Eur. J. Pharmacol. 356:67-72 (1998)); N-tosyl-L-phenylalanine 
chloromethyl ketone (Dryjanski et al. Biochemistry 37:14151-6 (1998)); 
Mouse gene for the serine protease inhibitor neuroserpin (P1 12) (Berger et 
al. Gene, 2 74:25-33 (1998)); Rat serine protease inhibitor 2.3 gene (Paul 

20 et al. Eur. J. Biochem. 254:538-46 (1998)); Ecotin (Yang et al. J. Mol. 
Biol. 273:945-57 (1998)); A 14 kDa plant-related serine protease inhibitor 
(Roch et al. Dev. Comp. Immunol. 22(1): 1-1 2 (1998)); Matrix-associated 
serine protease inhibitor TFPI-2/33 kDa MSPI (Rao et al. Int. J. Cancer 
76:749-56 (1998)); ONO-3403 (Hiwasa et al. Cancer Lett. 726:221-5 

25 (1998)); Bdellastasin (Moser et al. Eur. J. Biochem. 253:212-20 (1998)); 
Bikunin (Xu et al. J. Mol. Biol. 276:955-66 (1998)); Nafamostat mesilate 
(Mellgren et al. Thromb. Haemost. 73:342-7 (1998)); The growth hormone 
dependent serine protease inhibitor, Spi 2.1 (Maake et al. Endocrinology 
733:5630-6 (1997)); Growth factor activator inhibitor type 2, a Kunitz-type 

30 serine protease inhibitor (Kawaguchi et al. J. Biol. Chem., 272:27558-64 
(1997)); Heat-stable serine protease inhibitor protein from ovaries of the 
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desert locust, Schistocerga gregaria (Hamdaoui et al. Biochem. Biophys. 
Res. Commun. 235:357-60 (1997)); Human placental Hepatocyte growth 
factor activator inhibitor, a Kunitz-type serine protease inhibitor 
(Shimomura et al. J. Biol. Chem. 272:6370-6 (1997)); FUT-187, oral serine 
5 protease inhibitor (Shiozaki et al. Gan To Kaguku Ryoho, 23(14): 1971-9 
(1996)); Extracellular matrix-associated serine protease inhibitors (Mr 
33,000, 31,000, and 27,000 (Rao, C.N., et a/., Arch. Biochem. Biophys., 
335:82-92 (1996)); An irreversible isocoumarin serine protease inhibitor 
(Palencia, D.D., eta/., Biol. Reprod., 55:536-42 (1996)); 4-(2-aminoethyl)- 

10 benzenesulfonyl fluoride (AEBSF) (Nakabo et al. J. Leukoc. Biol. 60:328-36 
(1996)); Neuroserpin (Osterwalder, T., etal. f EMBO J. 75:2944-53 
(1996)); Human serine protease inhibitor alpha-1 -antitrypsin (Forney et al. 
J. Parasitol.. 82:496-502 (1996)); Rat serine protease inhibitor 2.3 (Simar- 
Blanchet, A.E., et al. t Eur. J. Biochem., 236:638-48 (1996)); Gebaxate 

15 mesilate (parodi, F., et al., J. Cardiothorac. Vase. Anesth. 70:235-7 

(1996)); Recombinant serine protease inhibitor, CPTI II (Stankiewicz, M., et 
al., (Acta Biochim. Pol. 43(3):525-9 (1996)); A cysteine-rich serine 
protease inhibitor (Guamerin II) (Kim, D.R., eta/., J. Enzym. Inhib., 70:81- 
91 (1996)); diisopropylfluorophosphate (Lundqvist, H., et al., Inflamm. Res. 

20 44(12):510-7 (1995)); Nexin 1 (Yu, D.W., etal. f J. Cell Sci. 108(Pt 
72,1:3867-74 (1995)); LEX032 (Scalia, R., eta/., Shock 4^:251-6 
(1995)); Protease nexin I (Houenou, L.J., eta/., Proc. Natl. Acad. Sci. 
U.S.A. 92(3):895-9 (1995)); Chymase-directed serine protease inhibitor 
(Woodard S.L., eta/., J. Immunol. 153(1 7>/:501 6-25 (1994)); N-alpha- 

25 tosyl-L-lysyl-chloromethyl ketone (TLCK) (Bourinbaiar, A.S., eta/., Cell 
Immunol. 155(1 7:230-6 (1994)); Smpi56 (Ghendler, Y., et al., Exp. 
Parasitol. 73^:121-31 (1994)); Schistosoma haematobium serine protease 
inhibitor (Blanton, R.E., eta/., Mo/. Biochem. Parasitol. 63(1)\\A^ (1994)); 
Spi-1 (Warren, W.C., eta/., Mo/. Cell Endocrinol. 98(1):27-32 (1993)); 

30 TAME (Jessop, J. J., eta/., Inflammation 77^:613-31 (1993)); 

Antithrombin III (Kalaria, R.N., et al., Am. J. Pathol. 743^:886-93 
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(1993)); FOY-305 (Ohkoshi, M., et al., Anticancer Res. 13(4):963-6 
(1993)); Camostat mesilate (Senda, S., eta/., Intern. Med. 32(4):3S0-4 
(1993)); Pigment epithelium-derived factor (Steele, F.R., et al., Proc. Natl. 
Acad. Sci. U.S.A. 90(4): 1 526-30 (1 993)); Antistasin (Holstein, T.W., et 
5 al., FEBS Lett. 309(3): 288-92 (1 992)); the vaccinia virus K2L gene 

encodes a serine protease inhibitor (Zhou, J., et al., Virology 189(2):61Z- 
86 (1992)); Bowman-Birk serine-protease inhibitor (Werner, M.H., et al., J. 
Mol. Biol. 225(3): 873-89 (1992); FUT-175 (Yanamoto, H., eta/., 
Neurosurgery 30^:358-63 (1992)); FUT-175; (Yanamoto, H., eta/., 

10 Neurosurgery 30(3) :35 1-6, discussion 356-7 (1992)); PAI-I (Yreadwell, 

B.V., eta/., J. Orthop. Res. 9^:309-16 (1991)); 3,4-Dichloroisocoumarin 
(Rusbridge, N.M., eta/., FEBS Lett. 268(1)^33-6 (1990)); Alpha 1- 
antichymotrypsin (Lindmark, B.E., eta/., Am. Rev. Respir. Des. 141(4 Pt 
1):884-8 (1990)); P-toluenesulfonyl-L-arginine methyl ester (TAME) 

15 (Scuderi, P., J. Immunol., 143(1)^68-73 (1989)); Alpha 1- 

antichymotrypsin (Abraham, C.R., et al. Cell 52^:487-501 (1988)); 
Contrapsin (Modha, J., eta/., Parasitology 96 (Pt 7,/:99-109 (1988)); 
Alpha 2-antiplasmin (Holmes, W.E., et al., J. Biol. Chem. 262(4) A 659-64 
(1987)); 3,4-dichloroisocoumarin (Harper, J.W., eta/., Biochemistry 

20 24fS;:1831-41 (1985)); Diisopropylfluorophosphate (Tsutsui, K., eta/., 
Biochem. Biophys. Res. Commun. 123(1):21\-1 (1984)); Gabexate 
mesilate (Hesse, B., eta/., Pharmacol. Res. Commun. 7677^:637-45 
(1984)); Phenyl methyl sulfonyl fluoride (Dufer, J., et al., Scand. J. 
Haematol. 32(1):25-32 (1984)); Protease inhibitor CI-2 (McPhalen, C.A., et 

25 al., J. Mol. Biol. 168(2)AA5-1 (1983)); Phenylmethylsulfonyl fluoride 
(Sekar etal., Biochem. Biophys. Res. Commun., 89(2):474-8 (1979)); 
PGE1 (Feinstein eta/., Prostaglandins 74^:1075-93 (1977)). 
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c. Combinatorial libraries and other libraries 

The source of compounds for the screening assays, can be libraries, 
including, but are hot limited to, combinatorial libraries. Methods for 
synthesizing combinatorial libraries and characteristics of such 
5 combinatorial libraries are known in the art (See generally, Combinatorial 
Libraries: Synthesis, Screening and Application Potential (Cortese Ed.) 
Walter de Gruyter, Inc., 1995; Tietze and Lieb, Curr. Opin. Chem. Biol. 
2^:363-71 (1 998); Lam, Anticancer Drug Des. 72/^:145-67(1997); 
Blaney and Martin, Curr. Opin. Chem. Biol. 7(1):54-9 (1997); and Schultz 

10 and Schultz, Biotechnol. Prog. 72(61:729-43 (1996)). 

Methods and strategies for generating diverse libraries, primarily 
peptide- and nucleotide-based oligomer libraries, have been developed 
using molecular biology methods and/or simultaneous chemical synthesis 
methodologies (see, e.g. , Dower et a/., Annu. Rep. Med. Chem. 25:271- 

15 280 (1991); Fodor et a/., Science 257:767-773 (1991); Jung et al., 

Angew. Chem. Ind. Ed. Engl. 37:367-383 (1992); Zuckerman et al., Proc. 
Natl. Acad. Sci. USA 35:4505-4509 (1992); Scott etal. t Science 
245:386-390 (1990); Devlin et al., Science 245:404-406 (1990); Cwirla et 
al., Proc. Natl. Acad. Sci. USA 57:6378-6382 (1990); and Gallop et al., J. 

20 Medicinal Chemistry 37:1233-1251 (1994)). The resulting combinatorial 
libraries potentially contain millions of compounds that can be screened to 
identify compounds that exhibit a selected activity. 

The libraries fall into roughly three categories: fusion-protein- 
displayed peptide libraries in which random peptides or proteins are 

25 presented on the surface of phage particles or proteins expressed from 
plasmids; support-bound synthetic chemical libraries in which individual 
compounds or mixtures of compounds are presented on insoluble matrices, 
such as resin beads (see, e.g., Lam et al., Nature 354:82-84 (1991)) and 
cotton supports (see, e.g., Eichler et al., Biochemistry 32\\ 1035-1 1041 

30 (1993)); and methods in which the compounds are used in solution (see, 
e.g., Houghten et al., Nature 354:84-86 (1991); Houghten et al., 
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BioTechniques 3/3:412-421 (1992); and Scott eta/., Curr. Opin. 
Biotechnol. 5:40-48 (1994)). There are numerous examples of synthetic 
peptide and oligonucleotide combinatorial libraries and there are many 
methods for producing libraries that contain non-peptidic small organic 
5 molecules. Such libraries can be based on a basis set of monomers that 
are combined to form mixtures of diverse organic molecules or that can be 
combined to form a library based upon a selected pharmacophore 
monomer. 

Either a random or a deterministic combinatorial library can be 

10 screened by the presently disclosed and/or claimed screening methods. In 
either of these two libraries, each unit of the library is isolated and/or 
immobilized on a solid support. In the deterministic library, one knows a 
priori a particular unit's location on each solid support. In a random library, 
the location of a particular unit is not known a priori although each site still 

15 contains a single unique unit. Many methods for preparing libraries are 
known to those of skill in this art (see, e.g., Geysen et at., Proc. Natl. 
Acad. Sci. USA 37:3998-4002 (1984), Houghten eta/., Proc. Natl. Acad. 
Sci. USA 37:5131-5135 (1985)). 

Combinatorial libraries generated by any techniques known to those 

20 of skill in the art are contemplated (see, e.g., Table 1 of Schultz and 

Schultz, Biotechnol. Prog. 72^:729-43 (1996)) for screening; Bartel eta/., 
Science 267:1411-1418 (1993); Baumbach eta/. BioPharm (May):24-35 
(1992); Bock eta/. Nature 355:564-566 (1992); Borman, S., 
Combinatorial chemists focus on small molecules molecular recognition, 

25 and automation, Chem. Eng. News 2(72):29 (1996); Boublik, et a/., 

Eukaryotic Virus Display: Engineering the Major Surface Glycoproteins of 
the Autographa Californica Nuclear Polyhedrosis Virus (ACNPV) for the 
Presentation of Foreign Proteins on the Virus Surface, Bio/Technology 
73:1079-1084 (1995); Brenner, et a/., Encoded Combinatorial Chemistry, 

30 Proc. Natl. Acad Sci. U.S.A. 55:5381-5383 (1992); Caflisch, eta/., 
Computational Combinatorial Chemistry for De Novo Ligand Design: 
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Review and Assessment, Perspect. Drug Discovery Des. 3:51-84 (1995); 
Cheng, eta/., Sequence-Selective Peptide Binding with a Peptido-A,B-fraA7s- 
steroidal Receptor Selected from an Encoded Combinatorial Library, J. Am. 
Chem. Soc. 7 75:1813-1814 (1996); Chu, et al., Affinity Capillary 
5 Electrophoresis to Identify the Peptide in A Peptide Library that Binds Most 
Tightly to Vancomycin, J. Org. Chem. 55:648-652 (1993); Clackson, et 
al., Making Antibody Fragments Using Phage Display Libraries, Nature 
352:624-628 (1991); Combs, et al., Protein Structure-Based Combinatorial 
Chemistry: Discovery of Non-Peptide Binding Elements to Src SH3 Domain, 

10 J. Am. Chem. Soc. 7 75:287-288 (1996); Cwirla, et al., Peptides On 
Phage: A Vast Library of Peptides for Identifying Ligands, Proc. Natl. 
Acad. Set. U.S.A. 57:6378-6382 (1990); Ecker, eta/., Combinatorial Drug 
Discovery: Which Method will Produce the Greatest Value, Bio/Technology 
75:351-360 (1995); Ellington, eta/., In Vitro Selection of RNA Molecules 

15 That Bind Specific Ligands, Nature, 545:818-822 (1990); Ellman, J. A., 
Variants of Benzodiazepines, J. Am. Chem. Soc, 7 74:10997 (1992); 
Erickson, eta/., The Proteins; Neurath, H., Hill, R.L., Eds.: Academic: New 
York, 1976; pp. 255-257; Felici, eta/., J. Mo/. Biol. 222:301-310 (1991); 
Fodor, et al., Light-Directed, Spatially Addressable Parallel Chemical 

20 Synthesis, Science 251:767-773 (1991); Francisco, etaL, Transport and 
Anchoring of Beta-Lactamase to the External Surface of E. Coli., Proc. Natl. 
Acad. Sci. U.S.A. 55:2713-2717 (1992); Georgiou, et al., Practical 
Applications of Engineering Gram-Negative Bacterial Cell Surfaces, 
TIBTECH 7 7:6-10 (1993); Geysen, eta/., Use of peptide synthesis to probe 

25 viral antigens for epitopes to a resolution of a single amino acid, Proc. Natl. 
Acad. Sci. U.S.A. 57:3998-4002 (1984); Glaser, eta/., Antibody 
Engineering by Condon-Based Mutagenesis in a Filamentous Phage Vector 
System, J. Immunol. 743:3903-3913 (1992); Gram, eta/., In vitro 
selection and affinity maturation of antibodies from a naive combinatorial 

30 immunoglobulin library, Proc. Natl. Acad. Sci. 53:3576-3580 (1992); Han, 
et al., Liquid-Phase Combinatorial Synthesis, Proc. Natl. Acad. Sci. U.S.A. 
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92:6419-6423 (1995); Hoogenboom, eta/., Multi-Subunit Proteins on the 
Surface of Filamentous Phage: Methodologies for Displaying Antibody (Fab) 
Heavy and Light Chains, Nucleic Acids Res. 73:4133-4137 (1991); 
Houghten, e t a/. , General Method for the Rapid Solid-Phase Synthesis of 
5 Large Numbers of Peptides: Specificity of Antigen-Antibody Interaction at 
the Level of Individual Amino Acids, Proc. Natl. Acad. Sci. U.S.A. 
52:5131-5135 (1985); Houghten, eta/. f The Use of Synthetic Peptide 
Combinatorial Libraries for the Determination of Peptide Ligands in Radio- 
Receptor Assays-Opioid-Peptides, Bioorg. Med. Chem. Lett. 3:405-412 

10 (1993); Houghten, et al., Generation and Use of Synthetic Peptide 

Combinatorial Libraries for Basic Research and Drug Discovery, Nature 
354:84-86 (1991); Huang, et al., Discovery of New Ligand Binding 
Pathways in Myoglobin by Random Mutagenesis, Nature Struct. Biol. 
7:226-229 (1994); Huse, et al., Generation of a Large Combinatorial 

15 Library of the Immunoglobulin Repertoire In Phage Lambda, Science 

246:1275-1281 (1989); Janda, K.D., New Strategies for the Design of 
Catalytic Antibodies, Biotechnol. Prog. 6:178-181 (1990); Jung, eta/., 
Multiple Peptide Synthesis Methods and Their Applications, Angew. Chem. 
Int. Ed. Engl. 37:367-486 (1992); Kang, eta/., Linkage of Recognition and 

20 Replication Functions By Assembling Combinatorial Antibody Fab Libraries 
Along Phage Surfaces, Proc. Natl. Acad. Sci. U.S.A. 33:4363-4366 
(1991a); Kang, et al., Antibody Redesign by Chain Shuffling from Random 
Combinatorial Immunoglobulin Libraries, Proc. Natl. Acad. Sci. U.S.A., 
35:1 1 120-1 1 123 (1991b); Kay, eta/., An M13 Phage Library Displaying 

25 Random 38-Amino-Acid-Peptides as a Source of Novel Sequences with 

Affinity to Selected Targets Genes, Gene 723:59-65 (1993); Lam, et a/., A 
new type of synthetic peptide library for identifying ligand-binding activity, 
Nature 354:82-84 (1991) (published errata in Nature 355:434 (1992) and 
Nature 360:768 (1992); Lebl, et a/., One Bead One Structure Combinatorial 

30 Libraries, Biopolymers (Pept. Sci.) 37:177-198 (1995); Lerner, eta/., 

Antibodies without Immunization, Science 253:1313-1314 (1992); Li, et 
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aL, Minimization of a Polypeptide Hormone, Science 270:1657-1660 
(1995); Light, etaL, Display of Dimeric Bacterial Alkaline Phosphatase on 
the Major Coat Protein of Filamentous Bacteriophage, Bioorg. Med. Chem. 
Lett. 3:1073-1079 (1992); Little, et aL, Bacterial Surface Presentation of 
5 Proteins and Peptides: An Alternative to Phage Technology, Trends 

Biotechnol. 7 7:3-5 (1993); Marks, eta/., By-Passing Immunization. Human 
Antibodies from V-Gene Libraries Displayed on Phage, J. Mol. Biol., 
222:581-597 (1991); Matthews, et al. f Substrate Phage: Selection of 
Protease Substrates by Monovalent Phage Display, Science 260:1 113- 

10 1117 (1993); McCafferty, et aL, Phage Enzymes: Expression and Affinity 
Chromatography of Functional Alkaline Phosphatase on the Surface of 
Bacteriophage, Protein Eng. 4:955-961 (1991); Menger, et aL, 
Phosphatase Catalysis Developed Via Combinatorial Organic Chemistry, J. 
Org. Chem. 60:6666-6667 (1995); Nicolaou, et aL, Angew. Chem. Int. Ed. 

15 Engl. 34:2289-2291 (1995); Oldenburg, et aL, Peptide Ligands for A 

Sugar-Binding Protein Isolated from a Random Peptide Library, Proc. Natl. 
Acad. Sci. U.S.A. 55:5393-5397 (1992); Parmley, et aL, Antibody- 
Selectable Filamentous fd Phage Vectors: Affinity Purification of Target 
Genes, Genes 73:305-318 (1988); Pinilla, et aL, Synthetic Peptide 

20 Combinatorial Libraries (SPCLS)-ldentification of the Antigenic Determinant 
of Beta-Endorphin Recognized by Monoclonal Antibody-3E7, Gene 128:71- 
76 (1993); Pinilla, et aL, Review of the Utility of Soluble Combinatorial 
Libraries, Biopolymers 37:221-240 (1995); Pistor, et aL, Expression of 
Viral Hemaegglutinin On the Surface of E. Co/L, Klin. Wochenschr. 66:1 10- 

25 116 (1989); Pollack, et aL, Selective Chemical Catalysis by an Antibody, 
Science 234:1570-1572 (1986); Rigler, etaL, Fluorescence Correlations, 
Single Molecule Detection and Large Number Screening: Applications in 
Biotechnology, J. Biotechnol. 4 7:177-186 (1995); Sarvetnick, etaL, 
Increasing the Chemical Potential of the Germ-Line Antibody Repertoire, 

30 Proc. Natl. Acad. Sci. U.S.A. 50:4008-4011 (1993); Sastry, etaL, Cloning 
of the Immunological Repertiore in Escherichia Co/i for Generation of 
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Monoclonal Catalytic Antibodies: Construction of a Heavy Chain Variable 
Region-Specific cDNA Library, Proc. Natl. Acad. Sci. U.S.A. 56:5728-5732 
(1989); Scott, et al., Searching for Peptide Ligands with an Epitope Library, 
Science 243:386-390 (1990); Sears, et al., Engineering Enzymes for 
5 Bioorganic Synthesis: Peptide Bond Formation, Biotechnol. Prog. 72:423- 
433 (1996); Simon, et. al., Peptides: A Modular Approach to Drug 
Discovery, Proc. Natl. Acad. Sci. U.S.A. 53:9367-9371 (1992); Still, et al., 
Discovery of Sequence-Selective Peptide Binding by Synthetic Receptors 
Using Encoded Combinatorial Libraries, Acc. Chem. Res. 23:155-163 

10 (1996); Thompson, et al., Synthesis and Applications of Small Molecule 
Libraries, Chem. Rev. 36:555-600 (1996); Tramontano, et al., Catalytic 
Antibodies, Science 254:1566-1570 (1986); Wrighton, et al., Small 
Peptides as Potent Mimetics of the Protein Hormone Erythropoietin, 
Science 273:458-464 (1996); York, et al., Combinatorial mutagenesis of 

15 the reactive site region in plasminogen activator inhibitor I, J. Biol. Chem. 
255:8595-8600 (1991); Zebedee, et al., Human Combinatorial Antibody 
Libraries to Hepatitis B Surface Antigen, Proc. Natl. Acad. Sci. U.S.A. 
53:3175-3179 (1992); Zuckerman, et al., Identification of Highest-Affinity 
Ligands by Affinity Selection from Equimolar Peptide Mixtures Generated 

20 by Robotic Synthesis, Proc. Natl. Acad. Sci. U.S.A. 53:4505-4509 (1992). 
For example, peptides that bind to a CVSP16 polypeptide or a 
protease domain of a SP protein can be identified using phage display 
libraries. In an exemplary embodiment, this method can include 
a) contacting phage from a phage library with the CVSP1 6 polypeptide or a 

25 protease domain thereof; (b) isolating phage that bind to the protein; and 
(c) determining the identity of at least one peptide coded by the isolated 
phage to identify a peptide that binds to a CVSP1 6 polypeptide. 
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H. Modulators of the activity of CVSP16 polypeptides 

Provided herein are compounds, identified by screening or produced 
using the CVSP16 polypeptide or protease domain in other screening 
methods, that modulate the activity of a CVSP1 6. These compounds act 
5 by directly interacting with the CVSP1 6 polypeptide or by altering 

transcription or translation thereof. Such molecules include, but are not 
limited to, antibodies that specifically react with a CVSP16 polypeptide, 
particularly with the protease domain thereof, antisense nucleic acids or 
double-stranded RNA (dsRNA) such as RNAi, including those that contain 
10 modified nucleic acids, that alter expression of the CVSP16 polypeptide, 
peptide mimetics and other such compounds. 
1 . Antibodies 

Antibodies, including polyclonal and monoclonal antibodies, that 
specifically bind to a CVSP16 polypeptide provided herein, including 

15 antibodies to single chain protease domains thereof, or to activated forms 
of a single-chain protease domain, or to full-length activaed or zymogen 
forms of the polypeptide or to other portions of a CVSP1 6 are provided. 
Generally, the antibody is a monoclonal antibody, and typically the 
antibody specifically binds to a protease domain of the CVSP1 6 

20 polypeptide. 

Provided are antibodies that specifically bind to any domain of 
CVSP16, and antibodies that specifically bind to two-chain and/or three- 
chain thereof. Also provided are antibodies that specifically bind to an 
active site or active site cleft of zymogen and activated forms. Neutralizing 

25 antibodies are also provided. 

The CVSP16 polypeptide and domains, fragments, homologs and 
derivatives thereof can be used as immunogens to generate antibodies that 
specifically bind CVSP1 6 polypeptides and portions thereof. Such 
antibodies include but are not limited to polyclonal, monoclonal, chimeric, 

30 single chain, Fab fragments, and a Fab expression library. In a specific 
embodiment, antibodies to human CVSP16 polypeptide are produced. In 
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another embodiment, complexes formed from fragments of CVSP16 
polypeptide, which fragments contain the serine protease domain, are used 
as immunogens for antibody production. 

Antibodies provided herein include, but are not limited to, 
5 monoclonal and polyclonal antibodies. They include antibodies that inhibits 
catalytic activity of a CVSP1 6 polypeptide provided herein and/or a ligand 
or substrate binding activity of the polypeptide. Also included are 
antibodies that specifically bind to a single-chain protease domain 1 (PD1) 
of a CVSP16 polypeptide and/or two-chain PD1 and antibodies and 

10 antibodies that specifically bind to a single-chain protease domain 1 (PD2) 
of a CVSP16 polypeptide and/or two chain PD2. Included are antibodies 
that specifically that bind to a single-chain form and/or to two-chain and/or 
three-chain forms of a CVSP16 polypeptide. 

Various procedures known in the art can be used for the production 

15 of polyclonal antibodies to CVSP16 polypeptide, its domains, derivatives, 
fragments or analogs. For production of the antibody, various host animals 
can be immunized by injection with the native CVSP16 polypeptide or a 
synthetic version, or a derivative of the foregoing, such as a cross-linked 
CVSP16 polypeptide. Such host animals include but are not limited to 

20 rabbits, mice, rats, chickens and other animals. Various adjuvants can be 
used to increase the immunological response, depending on the host 
species, and include but are not limited to Freund's (complete and 
incomplete), mineral gels such as aluminum hydroxide, surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 

25 emulsions, dinitrophenol, and potentially useful human adjuvants such as 
bacille Calmette-Guerin (BCG) and corynebacterium parvum. 

For preparation of monoclonal antibodies directed towards a CVSP16 
polypeptide or domains, derivatives, fragments or analogs thereof, any 
technique that provides for the production of antibody molecules by 

30 continuous cell lines in culture can be used. Such techniques include but 
are not restricted to the hybridoma technique originally developed by 
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Kohler and Milstein (Nature 256:495-497 (1975)), the trioma technique, 
the human B-cell hybridoma technique (Kozbor et a/., Immunology Today 
4:72 (1983)), and the EBV hybridoma technique to produce human 
monoclonal antibodies (Cole et a/., in Monoclonal Antibodies and Cancer 
5 Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)). For example, immortalized 
cell lines that secrete the desired monoclonal antibodies are provided. The 
immortalized cell lines secreting the desired antibodies are screened by 
immunoassays in which the antigen is the peptide hapten, polypeptide or 
protein. When the appropriate immortalized cell culture secreting the 

10 desired antibody is identified, the cells can be cultured either in vitro or by 
production in vivo via ascites fluid. 

Monoclonal antibodies can be produced by other methods, such as 
in germ-free animals utilizing recent technology (PCT/US90/02545). 
Human antibodies can be used and can be obtained by using human 

15 hybridomas (Cote eta/., Proc. Natl. Acad. Sci. USA 50:2026-2030 

(1983)), or by transforming human B cells with EBV virus in vitro (Cole et 
aL, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96 (1985)). Techniques developed for the production of "chimeric 
antibodies" (Morrison eta/., Proc. Natl. Acad. Sci. USA 81:6851-6855 

20 (1984); Neuberger eta/., Nature 372:604-608 (1984); Takeda eta/., 

Nature 374:452-454 (1985)) by splicing the genes from a mouse antibody 
molecule specific for the CVSP16 polypeptide together with genes from a 
human antibody molecule of appropriate biological activity can be used. 
Techniques described for the production of single chain antibodies 

25 (U.S. patent 4,946,778) can be adapted to produce CVSP16 polypeptide- 
specific single chain antibodies. An additional embodiment uses the 
techniques described for the construction of Fab expression libraries (Huse 
eta/., Science 246:1275-1281 (1989)) to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity for 

30 CVSP16 polypeptide or domains, derivatives, or analogs thereof. Non- 
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human antibodies can be "humanized" by known methods {see, e.g., U.S. 
Patent No. 5,225,539). 

Antibody fragments that specifically bind to CVSP16 polypeptide or 
epitopes thereof can be generated by techniques known in the art. For 
5 example, such fragments include but are not limited to: the F(ab')2 
fragment, which can be produced by pepsin digestion of the antibody 
molecule; the Fab' fragments that can be generated by reducing the 
disulfide bridges of the F(ab')2 fragment; the Fab fragments that can be 
generated by treating the antibody molecule with papain and a reducing 

10 agent; and Fv fragments. 

In the production of antibodies, screening for the desired antibody 
can be accomplished by techniques known in the art, e.g., ELISA (enzyme- 
linked immunosorbent assay). To select antibodies specific to a particular 
domain of the CVSP1 6 polypeptide one can assay generated hybridomas 

15 for a product that binds to the fragment of the CVSP1 6 polypeptide that 
contains such a domain. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and/or quantitation of CVSP16 polypeptide 
proteins, e.g., for imaging these proteins, measuring levels thereof in 

20 appropriate physiological samples, in, for example, diagnostic methods. In 
another embodiment, anti-CVSP16 polypeptide antibodies, or fragments 
thereof, containing the binding domain are used as therapeutic agents. 
2. Peptides, Polypeptides and Peptide Mimetics 
Provided herein are methods for identifying molecules that bind to 

25 and modulate the activity of SP proteins. Included among molecules that 
bind to SPs, particularly the single chain protease domain or catalytically 
active fragments thereof, are peptides, polypeptides and peptide mimetics, 
including cyclic peptides. Peptide mimetics are molecules or compounds 
that mimic the necessary molecular conformation of a ligand or polypeptide 

30 for specific binding to a target molecule such as a CVSP16 polypeptide. In 
an exemplary embodiment, the peptides, polypeptides or peptide mimetics 
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bind to the protease domain of the CVSP16 polypeptide. Such peptides 
and peptide mimetics include those of antibodies that specifically bind to a 
CVSP16 polypeptide and, typically, bind to the protease domain of a 
CVSP1 6 polypeptide. The peptides, polypeptides and peptide mimetics 
5 identified by methods provided herein can be agonists or antagonists of 
CVSP16 polypeptides. 

Such peptides and peptide mimetics are useful for diagnosing, 
treating, preventing, and screening for a disease or disorder associated 
with CVSP16 polypeptide activity in a mammal. In addition, the peptides 

10 and peptide mimetics are useful for identifying, isolating, and purifying 
molecules or compounds that modulate the activity of a CVSP16 
polypeptide, or specifically bind to a CVSP16 polypeptide, generally the 
protease domain of a CVSP16 polypeptide. Low molecular weight peptides 
and peptide mimetics can have strong binding properties to a target 

15 molecule, e.g. , a CVSP16 polypeptide and/or the protease domain of a 
CVSP16 polypeptide. 

Peptides, polypeptides and peptide mimetics that bind to CVSP16 
polypeptides as described herein can be administered to mammals, 
including humans, to modulate CVSP16 polypeptide activity. Thus, 

20 methods for therapeutic treatment and prevention of neoplastic diseases 
include administering a peptide, polypeptides or peptide mimetic compound 
in an amount sufficient to modulate such activity are provided. Thus, also 
provided herein are methods for treating a subject having such a disease or 
disorder in which a peptide, polypeptides or peptide mimetic compound is 

25 administered to the subject in a therapeutically effective dose or amount. 
Compositions containing the peptides, polypeptides or peptide 
mimetics can be administered for prophylactic and/or therapeutic 
treatments. In therapeutic applications, compositions can be administered 
to a patient already suffering from a disease, as described above, in an 

30 amount sufficient to cure or at least partially arrest the symptoms of the 
disease and its complications. Amounts effective for this use will depend 
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on the severity of the disease and the weight and general state of the 
patient and can be empirically determined. 

In prophylactic applications, compositions containing the peptides, 
polypeptides and peptide mimetics are administered to a patient susceptible 
5 to or otherwise at risk of a particular disease. Such an amount is defined to 
be a "prophylactically effective dose". In this use, the precise amounts 
again depend on the patient's state of health and weight. 
Accordingly, the peptides, polypeptides and peptide mimetics that bind to 
a CVSP1 6 polypeptide can be used to prepare pharmaceutical compositions 

10 containing, as an active ingredient, at least one of the peptides or peptide 
mimetics in association with a pharmaceutical carrier or diluent. The 
compounds can be administered, for example, by oral, pulmonary, parental 
(intramuscular, intraperitoneal, intravenous (IV) or subcutaneous injection), 
inhalation (via a fine powder formulation), transdermal, nasal, vaginal, 

15 rectal, or sublingual routes of administration and can be formulated in 
dosage forms appropriate for each route of administration (see, e.g., 
International PCT application Nos. WO 93/25221 and WO 94/17784; and 
European Patent Application 613,683). 

Peptides, polypeptides and peptide mimetics that bind to CVSP16 

20 polypeptides are useful in vitro as tools for understanding the biological 
role of CVSP16 polypeptides, including the evaluation of the many factors 
thought to influence, and be influenced by, the production of CVSP16 
polypeptide. Such peptides, polypeptides and peptide mimetics also are 
useful in the development of other compounds that bind to and modulate 

25 the activity of a CVSP16 polypeptide, because such compounds provide 
important information on the relationship between structure and activity 
that should facilitate such development. 

The peptides, polypeptides and peptide mimetics are also useful as 
competitive binders in assays to screen for new CVSP1 6 polypeptides or 

30 CVSP16 polypeptide agonists. In such assay embodiments, the 

compounds can be used without modification or can be modified in a 
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variety of ways; for example, by labeling, such as covalently or 
non-covalently joining a moiety which directly or indirectly provides a 
detectable signal. In any of these assays, the materials thereto can be 
labeled either directly or indirectly. Possibilities for direct labeling include 
5 label groups such as: radiolabels such as 125 l enzymes (U.S. Pat. No. 

3,645,090) such as peroxidase and alkaline phosphatase, and fluorescent 
labels (U.S. Pat. No. 3,940,475) capable of monitoring the change in 
fluorescence intensity, wavelength shift, or fluorescence polarization. 
Possibilities for indirect labeling include biotinylation of one constituent 

10 followed by binding to avidin coupled to one of the above label groups. The 
compounds also can include spacers or linkers in cases where the 
compounds are to be attached to a solid support. 

Moreover, based on their ability to bind to a CVSP1 6 polypeptide, 
the peptides, polypeptides and peptide mimetics can be used as reagents 

15 for detecting CVSP16 polypeptides in living cells, fixed cells, in biological 
fluids, in tissue homogenates and in purified, natural biological materials. 
For example, by labeling such peptides, polypeptides and peptide mimetics, 
cells having CVSP16 polypeptides can be identified. In addition, based on 
their ability to bind a CVSP16 polypeptide, the peptides, polypeptides and 

20 peptide mimetics can be used in in situ staining, FACS (fluores- 
cence-activated cell sorting), Western blotting, ELISA and other analytical 
protocols. Based on their ability to bind to a CVSP1 6 polypeptide, the 
peptides, polypeptides and peptide mimetics can be used in purification of 
CVSP16 polypeptides or in purifying cells expressing the CVSP16 

25 polypeptide, e.g., a polypeptide encoding the protease domain of a 
CVSP16 polypeptide. 

The peptides, polypeptides and peptide mimetics also can be used as 
commercial reagents for various medical research and diagnostic uses. The 
activity of the peptides and peptide mimetics can be evaluated either in 

30 vitro or in vivo in one of the numerous models described in McDonald 
(1992) Am. J. of Pediatric Hematology/Oncology, 74:8-21. 
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3. Peptide, polypeptides and peptide mimetic therapy 

Peptide analogs are commonly used in the pharmaceutical industry 
as non-peptide drugs with properties analogous to those of the template 
peptide. These types of non-peptide compounds are termed "peptide 
5 mimetics" or "peptidomimetics" (Luthman et al., A Textbook of Drug 
Design and Development, 14:386-406, 2nd Ed., Harwood Academic 
Publishers (1996); Joachim Grante (1994) Angew. Chem. Int. Ed. Engl., 
33:1699-1720; Fauchere (1986) J. Adv. Drug Res., 75:29; Veber and 
Freidinger (1985) TINS, p. 392; and Evans et al. (1987) J. Med. Chem. 

10 30:1229). Peptide mimetics that are structurally similar to therapeutically 
useful peptides can be used to produce an equivalent or enhanced 
therapeutic or prophylactic effect. Preparation of peptidomimetics and 
structures thereof are known to those of skill in this art. 

Systematic substitution of one or more amino acids of a consensus 

15 sequence with a D-amino acid of the same type {e.g., D-lysine in place of 
L-lysine) can be used to generate more stable peptides. In addition, 
constrained peptides containing a consensus sequence or a substantially 
identical consensus sequence variation can be generated by methods 
known in the art (Rizo et al. (1992) An. Rev. Biochem., 67:387, 

20 incorporated herein by reference); for example, by adding internal cysteine 
residues capable of forming intramolecular disulfide bridges which cyclize 
the peptide. 

Those skilled in the art appreciate that modifications can be made to 
the peptides and mimetics without deleteriously effecting the biological or 

25 functional activity of the peptide. Further, the skilled artisan would know 
how to design non-peptide structures in three dimensional terms, that 
mimic the peptides that bind to a target molecule, e.g., a CVSP16 
polypeptide or, generally, the protease domain of CVSP16 polypeptides 
(see, e.g., Eck and Sprang (1989) J. Biol. Chem., 26: 17605-18795). 

30 When used for diagnostic purposes, the peptides and peptide 

mimetics can be labeled with a detectable label and, accordingly, the 
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peptides and peptide mimetics without such a label can serve as 
intermediates in the preparation of labeled peptides and peptide mimetics. 
Detectable labels can be molecules or compounds, which when covalently 
attached to the peptides and peptide mimetics, permit detection of the 
5 peptide and peptide mimetics in vivo, for example, in a patient to whom 
the peptide or peptide mimetic has been administered, or in vitro, e.g., in a 
sample or cells. Suitable detectable labels are well known in the art and 
include, by way of example, radioisotopes, fluorescent labels (e.g., 
fluorescein), and the like. The particular detectable label employed is not 

10 critical and is selected to be detectable at non-toxic levels. Selection of 
the such labels is well within the skill of the art. 

Covalent attachment of a detectable label to the peptide or peptide 
mimetic is accomplished by conventional methods well known in the art. 
For example, when the 125 l radioisotope is employed as the detectable 

15 label, covalent attachment of 125 l to the peptide or the peptide mimetic can 
be achieved by incorporating the amino acid tyrosine into the peptide or 
peptide mimetic and then iodinating the peptide (see, e.g., Weaner et al. 
(1994) Synthesis and Applications of Isotopically Labelled Compounds, pp. 
137-140). If tyrosine is not present in the peptide or peptide mimetic, 

20 incorporation of tyrosine to the N or C terminus of the peptide or peptide 
mimetic can be achieved by well known chemistry. Likewise, 32 P can be 
incorporated onto the peptide or peptide mimetic as a phosphate moiety 
through, for example, a hydroxyl group on the peptide or peptide mimetic 
using conventional chemistry. 

25 Labeling of peptidomimetics usually involves covalent attachment of 

one or more labels, directly or through a spacer (e.g., an amide group), to 
non-interfering position(s) on the peptidomimetic that are predicted by 
quantitative structure-activity data and/or molecular modeling. Such 
non-interfering positions generally are positions that do not form direct 

30 contacts with the macromolecules(s) to which the peptidomimetic binds to 
produce the therapeutic effect. Derivatization (e.g. , labeling) of 
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peptidomimetics should not substantially interfere with the desired 
biological or pharmacological activity of the peptidomimetic. 

Peptides, polypeptides and peptide mimetics that can bind to a 
CVSP1 6 polypeptide or the protease domain of CVSP16 polypeptides 
5 and/or modulate the activity thereof, or exhibit CVSP16 protease activity, 
can be used for treatment of neoplastic disease. The peptides, 
polypeptides and peptide mimetics can be delivered, in vivo or ex vivo, to 
the cells of a subject in need of treatment. Further, peptides which have 
CVSP16 polypeptide activity can be delivered, in vivo or ex vivo, to cells 

10 which carry mutant or missing alleles encoding the CVSP1 6 polypeptide 
gene. Any of the techniques described herein or known to the skilled 
artisan can be used for preparation and in vivo or ex vivo delivery of such 
peptides, polypeptides and peptide mimetics that are substantially free of 
other human proteins. For example, the peptides, polypeptides and peptide 

15 mimetics can be readily prepared by expression in a microorganism or 
synthesis in vitro. 

The peptides or peptide mimetics can be introduced into cells, in 
vivo or ex vivo, by microinjection or by use of liposomes, for example. 
Alternatively, the peptides, polypeptides or peptide mimetics can be taken 

20 up by cells, in vivo or ex vivo, actively or by diffusion. In addition, 

extracellular application of the peptide, polypeptides or peptide mimetic can 
be sufficient to effect treatment of a neoplastic disease. Other molecules, 
such as drugs or organic compounds, that: 1) bind to a CVSP16 
polypeptide or protease domain thereof; or 2) have a similar function or 

25 activity to a CVSP1 6 polypeptide or protease domain thereof, can be used 
in methods for treatment. 

4. Rational drug design 

The goal of rational drug design is to produce structural analogs of 
biologically active polypeptides or peptides of interest or of small molecules 
30 or peptide mimetics with which they interact (e.g., agonists and 

antagonists) in order to fashion drugs which are, e.g., more active or stable 
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forms thereof; or which, for example, enhance or interfere with the 
function of a polypeptide in vivo (e.g., a CVSP16 polypeptide). In one 
approach, one first determines the three-dimensional structure of a protein 
of interest (e.g., a CVSP1 6 polypeptide or polypeptide having a protease 
5 domain) or, for example, of a CVSP16 polypeptide-ligand complex, by 
X-ray crystallography, by computer modeling or most typically, by a 
combination of approaches. Also, useful information regarding the 
structure of a polypeptide can be gained by modeling based on the 
structure of homologous proteins. In addition, peptides can be analyzed by 

10 an alanine scan. In this technique, an amino acid residue is replaced by 
Ala, and its effect on the peptide's activity is determined. Each of the 
amino acid residues of the peptide is analyzed in this manner to determine 
the important regions of the peptide. 

Also, a polypeptide or peptide that binds to a CVSP16 polypeptide 

15 or, generally, the protease domain of a CVSP16 polypeptide, can be 
selected by a functional assay, and then the crystal structure of this 
polypeptide or peptide can be determined. The polypeptide can be, for 
example, an antibody specific for a CVSP16 polypeptide and/or the 
protease domain of a CVSP16 polypeptide. This approach can yield a 

20 pharmacophore upon which subsequent drug design can be based. 
Further, it is possible to bypass the crystallography altogether by 
generating anti-idiotypic polypeptides or peptides, (anti-ids) to a functional, 
pharmacologically active polypeptide or peptide that binds to a CVSP16 
polypeptide or protease domain of a CVSP16 polypeptide. As a mirror 

25 image of a mirror image, the binding site of the anti-ids is expected to be 
an analog of the original target molecule, e.g., a CVSP1 6 polypeptide or 
polypeptide having a CVSP16 polypeptide. The anti-id could then be used 
to identify and isolate peptides from banks of chemically or biologically 
produced banks of peptides. Selected peptides would then act as the 

30 pharmacophore. 
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Thus, one can design drugs which have, e.g., improved activity or 

stability or which act as modulators (e.g., inhibitors, agonists, antagonists) 

of CVSP1 6 polypeptide activity, and are useful in the methods, particularly 

the methods for diagnosis, treatment, prevention, and screening of a 

5 neoplastic disease. By virtue of the availability of cloned CVSP1 6 

polypeptide sequences, sufficient amounts of the CVSP16 polypeptide can 

be made available to perform such analytical studies as X-ray 

crystallography. In addition, the knowledge of the amino acid sequence of 

a CVSP16 polypeptide or the protease domain thereof, e.g., the protease 

10 domain encoded by the nucleotide sequence of SEQ ID No. 6, can provide 

guidance on computer modeling techniques in place of, or in addition to, 

X-ray crystallography. 

Methods of identifying peptides and peptide mimettcs that 
bind to CVSP16 polypeptides 

15 Peptides having a binding affinity to the CVSP16 polypeptide are 

provided herein (e.g., a CVSP16 polypeptide or a polypeptide having a 
protease domain of a CVSP16 polypeptide) and can be readily identified, 
for example, by random peptide diversity generating systems coupled with 
an affinity enrichment process. Specifically, random peptide diversity 

20 generating systems include the "peptides on plasmids" system (see, e.g., 
U.S. Patent Nos. 5,270,170 and 5,338,665); the "peptides on phage" 
system (see, e.g., U.S. Patent No. 6,121,238 and Cwirla, eta/. (1990) 
Proc. Natl. Acad. Sci. U.S.A. 57:6378-6382); the "polysome system;" the 
"encoded synthetic library (ESL)" system; and the "very large scale 

25 immobilized polymer synthesis" system (see, e.g., U.S. Patent No. 

6,121,238; and Dower et al. (1991) An. Rep. Med. Chem. 26:271-280). 

For example, using the procedures described above, random 
peptides can generally be designed to have a defined number of amino acid 
residues in length (e.g., 12). To generate the collection of oligonucleotides 

30 encoding the random peptides, the codon motif (NNK)x, where N is 
nucleotide A, C, G, or T (equimolar; depending on the methodology 
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employed, other nucleotides can be employed), K is G or T (equimolar), and 
x is an integer corresponding to the number of amino acids in the peptide 
(e.g., 12) and can be used to specify any one of the 32 possible codons 
resulting from the NNK motif: 1 for each of 1 2 amino acids, 2 for each of 5 
5 amino acids, 3 for each of 3 amino acids, and only one of the three stop 
codons. Thus, the NNK motif encodes all of the amino acids, encodes only 
one stop codon, and reduces codon bias. 

The random peptides can be presented, for example, either on the 
surface of a phage particle, as part of a fusion protein containing either the 

10 pill or the p VII I coat protein of a phage fd derivative (peptides on phage) or 
as a fusion protein with the Lacl peptide fusion protein bound to a plasmid 
(peptides on plasmids). The phage or plasmids, including the DNA 
encoding the peptides, can be identified and isolated by an affinity 
enrichment process using immobilized CVSP16 polypeptide having a 

15 protease domain. The affinity enrichment process, sometimes called 
"panning," typically involves multiple rounds of incubating the phage, 
plasmids, or polysomes with the immobilized CVSP16 polypeptide, 
collecting the phage, plasmids, or polysomes that bind to the CVSP16 
polypeptide (along with the accompanying DNA or mRNA), and producing 

20 more of the phage or plasmids (along with the accompanying Lacl-peptide 
fusion protein) collected. 

Characteristics of peptides and peptide mimetics 
Among the peptides, polypeptides and peptide mimetics for 
therapeutic application are those having molecular weights from about 250 

25 to about 8,000 daltons. If such peptides are oligomerized, dimerized 

and/or derivatized with a hydrophilic polymer (e.g., to increase the affinity 
and/or activity of the compounds), the molecular weights of such peptides 
can be substantially greater and can range anywhere from about 500 to 
about 120,000 daltons, generally from about 8,000 to about 80,000 

30 daltons. Such peptides can contain 9 or more amino acids that are 

naturally occurring or synthetic (non-naturally occurring) amino acids. One 
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skilled in the art can determine the affinity and molecular weight of the 
peptides and peptide mimetics suitable for therapeutic and/or diagnostic 
purposes (e.g., see Dower et a/., U.S. Patent No. 6,121,238). 

5. Methods of preparing peptides and peptide mimetics 
5 Peptides and peptide mimetics can be designed, using a variety of 

methods, such as, for example, the "encoded synthetic library" or "very 
large scale immobilized polymer synthesis" systems (see, e.g., U.S. Patent 
Nos. 5,925,525 and 5,902,723). Using the "encoded synthetic library" or 
"very large scale immobilized polymer synthesis" systems, the minimum 

10 size of a peptide with an activity of interest, such as binding to a CVSP16, 
can be determined. In addition all peptides that form the group of peptides 
that differ from the desired motif (or the minimum size of that motif) in 
one, two, or more residues can be prepared. This collection of peptides 
then can be screened for an ability to bind to a target molecule, e.g., a 

15 CVSP16 polypeptide or, generally, the protease domain of a CVSP16 

polypeptide. This immobilized polymer synthesis system or other peptide 
synthesis methods also can be used to synthesize truncation analogs and 
deletion analogs and combinations of truncation and deletion analogs of 
the peptide compounds. 

20 Peptides that bind to CVSP16 polypeptides can be prepared by 

classical methods known in the art, for example, by using standard solid 
phase techniques. The standard methods include exclusive solid phase 
synthesis, partial solid phase synthesis methods, fragment condensation, 
classical solution synthesis, and even by recombinant DNA technology 

25 (see, e.g., Merrifield (1963) J. Am. Chem. Soc, 55:2149, incorporated 
herein by reference.) 

These procedures also can be used to synthesize peptides in which 
amino acids other than the 20 naturally occurring, genetically encoded 
amino acids are substituted at one, two, or more positions of the peptide. 

30 For instance, naphthylalanine can be substituted for tryptophan, facilitating 
synthesis. Other synthetic amino acids that can be substituted into the 
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peptides include L-hydroxypropyl, L-3, 4-dihydroxy-phenylalanyl, d amino 
acids such as L-d-hydroxylysyl and D-d-methylalanyl, L-a-methylalanyl, /? 
amino acids, and isoquinolyl. D amino acids and non-naturally occurring 
synthetic amino acids also can be incorporated into the peptides (see, e.g., 
5 Roberts et al. (1 983) Unusual Amino/Acids in Peptide Synthesis, 
5(6):341-449). 

The peptides also can be modified by phosphorylation (see, e.g., W. 
Bannwarth et al. (1 996) Biorganic and Medicinal Chemistry Letters, 
6(1 7):2141-2146), and other methods for making peptide derivatives (see, 

10 e.g., Hruby et al. (1990) Biochem. J., 268(2) :249-262). Thus, peptide 
compounds also serve as a basis to prepare peptide mimetics with similar 
biological activity. 

Those of skill in the art recognize that a variety of techniques are 
available for constructing peptide mimetics with the same or similar desired 

15 biological activity as the corresponding peptide compound but with more 
favorable activity than the peptide with respect to solubility, stability, and 
susceptibility to hydrolysis and proteolysis (see, e.g. , Morgan et al. (1989) 
An. Rep. Med. Chem., 24:243-252). Methods for preparing peptide 
mimetics modified at the N-terminal amino group, the C-terminal carboxyl 

20 group, and/or changing one or more of the amido linkages in the peptide to 
a non-amido linkage are known to those of skill in the art. 

Amino terminus modifications include, but are not limited to, 
alkylating, acetylating and adding a carbobenzoyl group, forming a 
succinimide group (see, e.g., Murray et al. (1995) Burger's Medicinal 

25 Chemistry and Drug Discovery, 5th ed.. Vol. 1, Manfred E. Wolf, ed., John 
Wiley and Sons, Inc.). C-terminal modifications include mimetics wherein 
the C-terminal carboxyl group is replaced by an ester, an amide or 
modifications to form a cyclic peptide. 

In addition to N-terminal and C-terminal modifications, the peptide 

30 compounds, including peptide mimetics, advantageously can be modified 
with or covalently coupled to one or more of a variety of hydrophilic 
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polymers. It has been found that when peptide compounds are derivatized 
with a hydrophilic polymer, their solubility and circulation half-lives can be 
increased and their immunogenicity is masked, with little, if any, 
diminishment in their binding activity. Suitable nonproteinaceous polymers 
5 include, but are not limited to, polyalkylethers as exemplified by 

polyethylene glycol and polypropylene glycol, polylactic acid, polyglycolic 
acid, polyoxyalkenes, polyvinylalcohol, polyvinylpyrrolidone, cellulose and 
cellulose derivatives, dextran and dextran derivatives. Generally, such 
hydrophilic polymers have an average molecular weight ranging from about 

10 500 to about 100,000 daltons, including from about 2,000 to about 
40,000 daltons and, from about 5,000 to about 20,000 daltons. The 
hydrophilic polymers can have average molecular weights of about 5,000 
daltons, 10,000 daltons and 20,000 daltons. The peptide compounds can 
be dimerized and each of the dimeric subunits can be covalently attached 

15 to a hydrophilic polymer. The peptide compounds can be PEGylated, i.e., 
covalently attached to polyethylene glycol (PEG). 

Methods for derivatizing peptide compounds or for coupling peptides 
to such polymers have been described (see, e.g., Zallipsky (1995) 
Bioconjugate Chem., 6:150-165; Monfardini et al. (1995) Bioconjugate 

20 Chem., 6:62-69; U.S. Pat. No. 4,640,835; U.S. Pat. No. 4,496,689; U.S. 
Pat. No. 4,301,144; U.S. Pat. No. 4,670,417; U.S. Pat. No. 4,791,192; 
U.S. Pat. No. 4,179,337 and WO 95/34326, all of which are incorporated 
by reference in their entirety herein). 

Other methods for making peptide derivatives are described, for 

25 example, in Hruby et al. (1990), Biochem J., 265(2) :249-262, which is 

incorporated herein by reference. Thus, the peptide compounds also serve 
as structural models for non-peptidic compounds with similar biological 
activity. Those of skill in the art recognize that a variety of techniques are 
available for constructing compounds with the same or similar desired 

30 biological activity as a particular peptide compound but with more 

favorable activity with respect to solubility, stability, and susceptibility to 
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hydrolysis and proteolysis (see, e.g., Morgan et al. (1989) An. Rep. Med. 
Chem., 24:243-252, incorporated herein by reference). These techniques 
include replacing the peptide backbone with a backbone composed of 
phosphonates, amidates, carbamates, sulfonamides, secondary amines, 
5 and N-methylamino acids. 

Peptide compounds can exist in a cyclized form with an 
intramolecular disulfide bond between the thiol groups of the cysteines. 
Alternatively, an intermolecular disulfide bond between the thiol groups of 
the cysteines can be produced to yield a dimeric (or higher oligomeric) 

10 compound. One or more of the cysteine residues also can be substituted 
with a homocysteine. 
I. Conjugates 

A conjugate, containing: a) a single chain protease domain (or 
proteolytically active portion thereof) of a CVSP1 6 polypeptide or a full 

15 length zymogen, activated form thereof, or two or single chain protease 
domain thereof; and b) a targeting agent linked to the CVSP1 6 polypeptide 
directly or via a linker, wherein the agent facilitates: i) affinity isolation or 
purification of the conjugate; ii) attachment of the conjugate to a surface; 
iii) detection of the conjugate; or iv) targeted delivery to a selected tissue 

20 or cell, is provided herein. The conjugate can be a chemical conjugate or a 
fusion protein mixture thereof. 

The targeting agent can be a protein or peptide fragment, such as a 
tissue specific or tumor specific monoclonal antibody or growth factor or 
fragment thereof linked either directly or via a linker to a CVSP16 

25 polypeptide or a protease domain thereof. The targeting agent also can be 
a protein or peptide fragment that contains a protein binding sequence, a 
nucleic acid binding sequence, a lipid binding sequence, a polysaccharide 
binding sequence, or a metal binding sequence, or a linker for attachment 
to a solid support. In a particular embodiment, the conjugate contains a) 

30 the CVSP16 or portion thereof, as described herein; and b) a targeting 
agent linked to the CVSP16 polypeptide directly or via a linker. 
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Conjugates, such as fusion proteins and chemical conjugates, of the 
CVSP16 polypeptide with a protein or peptide fragment (or plurality 
thereof) that functions, for example, to facilitate affinity isolation or 
purification of the CVSP1 6 polypeptide domain, attachment of the CVSP1 6 
5 polypeptide domain to a surface, or detection of the CVSP16 polypeptide 
domain are provided. The conjugates can be produced by chemical 
conjugation, such as via thiol linkages, and can be produced by 
recombinant means as fusion proteins. In the fusion protein, the peptide or 
fragment thereof is linked to either the N-terminus or C-terminus of the 

10 CVSP16 polypeptide domain. In chemical conjugates the peptide or 

fragment thereof can be linked anywhere that conjugation can be effected, 
and there can be a plurality of such peptides or fragments linked to a single 
CVSP16 polypeptide domain or to a plurality thereof. 

The targeting agent is for in vitro or in vivo delivery to a cell or 

15 tissue, and includes agents such as cell or tissue-specific antibodies, 
growth factors and other factors that bind to moieties expressed on 
specific cells; and other cell or tissue specific agents that promote directed 
delivery of a linked protein. The targeting agent can be one that 
specifically delivers the CVSP16 polypeptide to selected cells by interaction 

20 with a cell surface protein and internalization of conjugate or CVSP16 
polypeptide portion thereof. 

These conjugates are used in a variety of methods and are 
particularly suited for use in methods of activation of prodrugs, such as 
prodrugs that upon cleavage by the particular CVSP16, which is localized 

25 at or near the targeted cell or tissue, are cytotoxic. The prodrugs are 
administered prior to, or simultaneously with, or subsequently to the 
conjugate. Upon delivery to the targeted cells, the protease activates the 
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prodrug, which then exhibits a therapeutic effect, such as a cytotoxic 
effect. 

1 . Conjugation 

Conjugates with linked CVSP16 polypeptides and/or domains thereof 
5 can be prepared either by chemical conjugation, recombinant DNA 

technology, or by combinations of recombinant expression and chemical 
conjugation. The CVSP16 polypeptide domains and the targeting agent 
can be linked in any orientation and more than one targeting agents and/or 
CVSP16 polypeptide domains can be present in a conjugate. 
10 a. Fusion proteins 

Fusion proteins are provided herein. A fusion protein contains: 
a) one or a plurality of domains of a CVSP16 polypeptide; and b) a 
targeting agent. The fusion proteins are generally produced by 
recombinant expression of nucleic acids that encode the fusion protein. 
15 b. Chemical conjugation 

To effect chemical conjugation herein, the CVSP16 polypeptide 
domain is linked via one or more selected linkers or directly to the targeting 
agent. Chemical conjugation must be used if the targeted agent is other 
than a peptide or protein, such as a nucleic acid or a non-peptide drug. 
20 Any means known to those of skill in the art for chemically conjugating 
selected moieties can be used. 

2. Linkers 

Linkers for can be included in the conjugates. The conjugates can 
include one or more linkers between the CVSP16 polypeptide portion and 
25 the targeting agent. Additionally, linkers are used for facilitating or 

enhancing immobilization of a CVSP16 polypeptide or portion thereof on a 
solid support, such as a microtiter plate, silicon or silicon-coated chip, glass 
or plastic support, such as for high throughput solid phase screening 
protocols. Any linker known to those of skill in the art for preparation of 
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conjugates can be used herein. These linkers are typically used in the 
preparation of chemical conjugates; peptide linkers can be incorporated into 
fusion proteins. 

Linkers can be any moiety suitable to associate a domain of CVSP16 
5 polypeptide and a targeting agent. Such linkers and linkages include, but 
are not limited to, peptidic linkages, amino acid and peptide linkages, 
typically containing between one and about 60 amino acids, more generally 
between about 10 and 30 amino acids, chemical linkers, such as heterobi- 
functional cleavable cross-linkers, including but are not limited to, N- 

10 succinimidyl (4-iodoacetyl)-aminobenzoate, sulfosuccinimidyl (4- 

iodoacetyD-aminobenzoate, 4-succinimidyl-oxycarbonyl-a-(2-pyridyldithio)- 
toluene, sulfosuccinimidyl-6-[a-methyl-a-(pyridyldithiol)-toluamido] 
hexanoate, N-succinimidyl-3-(-2-pyridyldithio) - propionate, succinimidyl 
6[3(-(-2-pyridyldithio)-propionamido] hexanoate, sulfosuccinimidyl 6[3(-(-2- 

15 pyridyldithio)-propionamido] hexanoate, 3-(2-pyridyldithio)-propionyl 

hydrazide, Ellman's reagent, dichlorotriazinic acid, and S-(2-thiopyridyl)-L- 
cysteine. Other linkers include, but are not limited to peptides and other 
moieties that reduce steric hindrance between the domain of CVSP1 6 
polypeptide and the targeting agent, intracellular enzyme substrates, linkers 

20 that increase the flexibility of the conjugate, linkers that increase the 

solubility of the conjugate, linkers that increase the serum stability of the 
conjugate, photocleavable linkers and acid cleavable linkers. 

Other exemplary linkers and linkages that are suitable for chemically 
linked conjugates include, but are not limited to, disulfide bonds, thioether 

25 bonds, hindered disulfide bonds, and covalent bonds between free reactive 
groups, such as amine and thiol groups. These bonds are produced using 
heterobifunctional reagents to produce reactive thiol groups on one or both 
of the polypeptides and then reacting the thiol groups on one polypeptide 
with reactive thiol groups or amine groups to which reactive maleimido 

30 groups or thiol groups can be attached on the other. Other linkers include, 
acid cleavable linkers, such as bismaleimideothoxy propane, acid labile- 
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transferrin conjugates and adipic acid diihydrazide, that would be cleaved 
in more acidic intracellular compartments; cross linkers that are cleaved 
upon exposure to UV or visible light and linkers, such as the various 
domains, such as C H 1 , C H 2, and C H 3, from the constant region of human 
5 IgG! (see, Batra eta/. Molecular Immunol. 30:379-386 (1993)). In some 
embodiments, several linkers can be included in order to take advantage of 
desired properties of each linker. 

Chemical linkers and peptide linkers can be inserted by covalently 
coupling the linker to the domain of CVSP16 polypeptide and the targeting 

10 agent. The heterobifunctional agents, described below, can be used to 
effect such covalent coupling. Peptide linkers also can be linked by 
expressing DNA encoding the linker and therapeutic agent (TA), linker and 
targeted agent, or linker, targeted agent and therapeutic agent (TA) as a 
fusion protein. Flexible linkers and linkers that increase solubility of the 

15 conjugates are contemplated for use, either alone or with other linkers are 

also contemplated herein. 

a) Acid cleavable, photocleavable and heat sensitive 
linkers 

Acid cleavable linkers, photocleavable and heat sensitive linkers also 
20 can be used, particularly where it can be necessary to cleave the domain of 

CVSP16 polypeptide to permit it to be more readily accessible to reaction. 

Acid cleavable linkers include, but are not limited to, bismaleimideothoxy 

propane; and adipic acid dihydrazide linkers (see, e.g., Fattom et al. (1992) 

Infection & Immun. 60:584-589) and acid labile transferrin conjugates that 
25 contain a sufficient portion of transferrin to permit entry into the 

intracellular transferrin cycling pathway (see, e.g. , Welhoner et al. (1991) 

J. Biol. Chem. 266:4309-4314). 

Photocleavable linkers are linkers that are cleaved upon exposure to 

light (see, e.g., Goldmacher et al. (1992) Bioconj. Chem. 3:104-107, which 
30 linkers are herein incorporated by reference), thereby releasing the targeted 

agent upon exposure to light. Photocleavable linkers that are cleaved upon 
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exposure to light are known (see, e.g., Hazum et al. (1981) in Pept., Proc. 
Eur. Pept. Symp., 16th, Brunfeldt, K (Ed), pp. 105-110, which describes 
the use of a nitrobenzyl group as a photocleavable protective group for 
cysteine; Yen et al. (1989) Makromol. Chem 730:69-82, which describes 
5 water soluble photocleavable copolymers, including 

hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein 
copolymer and methylrhodamine copolymer; Goldmacheralso et al. (1992) 
Bioconj. Chem. 3:104-107, which describes a cross-linker and reagent that 
undergoes photolytic degradation upon exposure to near UV light (350 

10 nm); and Senter et al. (1985) Photochem. Photobiol 42:231-237 which 
describes nitrobenzyloxycarbonyl chloride cross linking reagents that 
produce photocleavable linkages), thereby releasing the targeted agent 
upon exposure to light. Such linkers would have particular use in treating 
dermatological or ophthalmic conditions that can be exposed to light using 

15 fiber optics. After administration of the conjugate, the eye or skin or other 
body part can be exposed to light, resulting in release of the targeted 
moiety from the conjugate. Such photocleavable linkers are useful in 
connection with diagnostic protocols in which it is desirable to remove the 
targeting agent to permit rapid clearance from the body of the animal. 

20 b) Other linkers for chemical conjugation 

Other linkers, include trityl linkers, particularly, derivatized 
trityl groups to generate a genus of conjugates that provide for 
release of therapeutic agents at various degrees of acidity or alkalinity. 
The flexibility thus afforded by the ability to preselect the pH range at 

25 which the therapeutic agent is released allows selection of a linker based 
on the known physiological differences between tissues in need of delivery 
of a therapeutic agent (see, e.g., U.S. Patent No. 5,612,474). For 
example, the acidity of tumor tissues appears to be lower than that of 
normal tissues. 
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c) Peptide linkers 

The linker moieties can be peptides. Peptide linkers can be 
employed in fusion proteins and also in chemically linked conjugates. The 
peptide typically has from about 2 to about 60 amino acid residues, for 
5 example from about 5 to about 40, or from about 10 to about 30 amino 
acid residues. The length selected depends upon factors, such as the use 
for which the linker is included. 

Peptide linkers are advantageous when the targeting agent is 
proteinaceous. For example, the linker moiety can be a flexible spacer 

10 amino acid sequence, such as those known in single-chain antibody 

research. Examples of such known linker moieties include, but are not 
limited to, peptides, such as (Gly m Ser) n and (Ser m Gly) n , in which n is 1 to 
6, including 1 to 4 and 2 to 4, and m is 1 to 6, including 1 to 4, and 2 to 
4, enzyme cleavable linkers and others. 

15 Additional linking moieties are known. See, for example, Huston et 

aL, Proc. Natl. Acad. Sci. U.S.A. 55:5879-5883, 1988; Whitlow, M., et 
a/., Protein Engineering 6:989-995, 1993; Newton et aL, Biochemistry 
35:545-553, 1996; A. J. Cumber et aL, Bioconj. Chem. 3:397-401, 1992; 
Ladurner et aL, J. Mol. Biol. 273:330-337, 1997; and U.S. Patent No. 

20 4,894,443. In some embodiments, several linkers can be included in order 
to take advantage of desired properties of each linker. 
3. Targeting agents 

Any agent that facilitates detection, immobilization, or purification of 
the conjugate is contemplated for use herein. For chemical conjugates any 

25 moiety that has such properties is contemplated; for fusion proteins, the 

targeting agent is a protein, peptide or fragment thereof that is sufficient to 
effect the targeting activity. Contemplated targeting agents include those 
that deliver the CVSP1 6 polypeptide or portion thereof to selected cells and 
tissues. Such agents include tumor specific monoclonal antibodies and 

30 portions thereof, growth factors, such as FGF, EGF, PDGF, VEGF, 
cytokines, including chemokines, and other such agents. 
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4. Nucleic acids, plasmids and cells 

Isolated nucleic acid fragments encoding fusion proteins are 
provided. The nucleic acid fragment that encodes the fusion protein 
includes: a) nucleic acid encoding a protease domain of a CVSP1 6 
5 polypeptide; and b) nucleic acid encoding a protein, peptide or effective 
fragment thereof that facilitates: i) affinity isolation or purification of the 
fusion protein; ii) attachment of the fusion protein to a surface; or iii) 
detection of the fusion protein. Generally, the nucleic acid is DNA. 

Plasmids for replication and vectors for expression that contain the 

10 above nucleic acid fragments are also provided. Cells containing the 

plasmids and vectors are also provided. The cells can be any suitable host 
including, but are not limited to, bacterial cells, yeast cells, fungal cells, 
plant cells, insect cell and animal cells. The nucleic acids, plasmids, and 
cells containing the plasmids can be prepared according to methods known 

15 in the art including any described herein. 

Also provided are methods for producing the above fusion proteins. 
An exemplary method includes the steps of growing cells (i.e., culturing 
the cells so that they proliferate) containing a plasmid encoding the fusion 
protein under conditions whereby the fusion protein is expressed by the 

20 cell, and recovering the expressed fusion protein. Methods for expressing 
and recovering recombinant proteins are well known in the art (See 
generally, Current Protocols in Molecular Biology (1998) § 16, John Wiley 
& Sons, Inc.) and such methods can be used for expressing and recovering 
the expressed fusion proteins. 

25 The recovered fusion proteins can be isolated or purified by methods 

known in the art such as, for example, centrifugation, filtration, 
chromatography, electrophoresis and immunoprecipitation, or by a 
combination thereof (See generally, Current Protocols in Molecular Biology 
(1998) § 10, John Wiley & Sons, Inc.). Generally the recovered fusion 

30 protein is isolated or purified through affinity binding between the protein 
or peptide fragment of the fusion protein and an affinity binding moiety. 
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As discussed in the above sections regarding the construction of the fusion 
proteins, any affinity binding pairs can be constructed and used in the 
isolation or purification of the fusion proteins. For example, the affinity 
binding pairs can be protein binding sequences/protein, DNA binding 
5 sequences/DNA sequences, RNA binding sequences/RNA sequences, lipid 
binding sequences/lipid, polysaccharide binding sequences/polysaccharide, 
or metal binding sequences/metal. 

5. Immobilization and supports or substrates therefor 

In certain embodiments, where the targeting agents are designed for 

10 linkage to surfaces, the CVSP16 polypeptide can be attached by linkage 
such as ionic or covalent, non-covalent or other chemical interaction, to a 
surface of a support or matrix material. Immobilization can be effected 
directly or via a linker. The CVSP16 polypeptide can be immobilized on 
any suitable support, including, but are not limited to, silicon chips, and 

15 other supports described herein and known to those of skill in the art. A 
plurality of CVSP16 polypeptide or protease domains thereof can be 
attached to a support, such as an array (i.e., a pattern of two or more) of 
conjugates on the surface of a silicon chip or other chip for use in high 
throughput protocols and formats. 

20 It also is noted that the domains of the CVSP1 6 polypeptide can be 

linked directly to the surface or via a linker without a targeting agent linked 
thereto. Hence chips containing arrays of the domains of the CVSP16 
polypeptide are also provided. 

The matrix material or solid supports contemplated herein are 

25 generally any of the insoluble materials known to those of skill in the art to 
immobilize ligands and other molecules, and are those that are used in 
many chemical syntheses and separations. Such supports are used, for 
example, in affinity chromatography, in the immobilization of biologically 
active materials, and during chemical syntheses of biomolecules, including 

30 proteins, amino acids and other organic molecules and polymers. The 
preparation of and use of supports is well known to those of skill in this 
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art; there are many such materials and preparations thereof known. For 
example, naturally-occurring support materials, such as agarose and 
cellulose, can be isolated from their respective sources, and processed 
according to known protocols, and synthetic materials can be prepared in 
5 accord with known protocols. 

The supports are typically insoluble materials that are solid, porous, 
deformable, or hard, and have any required structure and geometry, inclu- 
ding, but not limited to: beads, pellets, disks, capillaries, hollow fibers, 
needles, solid fibers, random shapes, thin films and membranes. Thus, the 

10 item can be fabricated from the matrix material or combined with it, such 
as by coating all or part of the surface or impregnating particles. 

Typically, when the matrix is particulate, the particles are at least 
about 10-2000 //m, but can be smaller or larger, depending upon the 
selected application. Selection of the matrices is governed, at least in part, 

15 by their physical and chemical properties, such as solubility, functional 

groups, mechanical stability, surface area swelling propensity, hydrophobic 
or hydrophilic properties and intended use. 

If necessary, the support matrix material can be treated to contain 
an appropriate reactive moiety. In some cases, the support matrix material 

20 already containing the reactive moiety can be obtained commercially. The 
support matrix material containing the reactive moiety can thereby serve as 
the matrix support upon which molecules are linked. Materials containing 
reactive surface moieties such as amino silane linkages, hydroxyl linkages 
or carboxysilane linkages can be produced by well established surface 

25 chemistry techniques involving silanization reactions, or the like. Examples 
of these materials are those having surface silicon oxide moieties, 
covalently linked to gamma-aminopropylsilane, and other organic moieties; 
N-[3-(triethyoxysilyl)propyl]phthalamic acid; and bis-(2-hydroxyethyl)amino- 
propyltriethoxysilane. Exemplary of readily available materials containing 

30 amino group reactive functionalities, include, but are not limited to, para- 
aminophenyltriethyoxysilane. Also derivatized polystyrenes and other such 
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polymers are well known and readily available to those of skill in this art 
(e.g. , the Tentagel® Resins are available with a multitude of functional 
groups, and are sold by Rapp Polymere, Tubingen, Germany; see, U.S. 
Patent No. 4,908,405 and U.S. Patent No. 5,292,814; see, also Butz et al. 
5 Peptide Res. 7:20-23 (1994); and Klein et al. Immunobiol. 750:53-66 
(1994)). 

These matrix materials include any material that can act as a support 
matrix for attachment of the molecules of interest. Such materials are 
known to those of skill in this art, and include those that are used as a 

10 support matrix. These materials include, but are not limited to, inorganics, 
natural polymers, and synthetic polymers, including, but are not limited to: 
cellulose, cellulose derivatives, acrylic resins, glass, silica gels, polystyrene, 
gelatin, polyvinyl pyrrolidone, co-polymers of vinyl and acrylamide, 
polystyrene cross-linked with divinylbenzene and others (see, Merrifield, 

15 Biochemistry 3:1385-1390 (1964)), polyacrylamides, latex gels, 
polystyrene, dextran, polyacrylamides, rubber, silicon, plastics, 
nitrocellulose, celluloses, natural sponges. Of particular interest herein, are 
highly porous glasses (see, e.g., U.S. Patent No. 4,244,721) and others 
prepared by mixing a borosilicate, alcohol and water. 

20 Synthetic supports include, but are not limited to: acrylamides, 

dextran-derivatives and dextran co-polymers, agarose-polyacrylamide 
blends, other polymers and co-polymers with various functional groups, 
methacrylate derivatives and co-polymers, polystyrene and polystyrene 
copolymers (see, e.g., Merrifield, Biochemistry 3:1 385-1 390 (1964); Berg 

25 et al. Innovation Perspect. Solid Phase Synth. Collect. Pap., Int. Symp., 
1st, Epton, Roger (Ed), pp. 453-459 (1990); Berg eta/., Pept. f Proc. Eur. 
Pept. Symp., 20th, Jung, G. et al. (Eds), pp. 196-198 (1989); Berg et al., 
J. Am. Chem. Soc. 7 7 7:8024-8026 (1989); Kent et al., /sr. J. Chem., 
17:243-247 (1979); Kent eta/.., J. Org. Chem. 43:2845-2852 (1978); 

30 Mitchell eta/., Tetrahedron Lett. 42:3795-3798 (1976); U.S. Patent No. 
4,507,230; U.S. Patent No. 4,006,1 17; and U.S. Patent No. 5,389,449). 
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Such materials include those made from polymers and co-polymers such as 
polyvinylalcohols, acrylates and acrylic acids such as poly- 
ethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethylene-co- 
ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co-acrylic 
5 acid, polypropylene-co-methyl-acrylic acid, polypropylene-co-ethylacrylate, 
polypropylene-co-methyl acrylate, polyethylene-co-vinyl acetate, poly- 
propylene-co-vinyl acetate, and those containing acid anhydride groups 
such as polyethylene-co-maleic anhydride and polypropylene-co-maleic 
anhydride. Liposomes have also been used as solid supports for affinity 

10 purifications (Powell et al. Biotechnol. Bioeng. 33:173 (1989)). 

Numerous methods have been developed for the immobilization of 
proteins and other biomolecules onto solid or liquid supports (see, e.g., 
Mosbach, Methods in Enzymology 44 (1976); Weetall, Immobilized 
Enzymes, Antigens, Antibodies, and Peptides, (1975); Kennedy eta/., Solid 

15 Phase Biochemistry, Analytical and Synthetic Aspects, Scouten, ed., pp. 
253-391 (1983); see, generally, Affinity Techniques. Enzyme Purification: 
Part B. Methods in Enzymology, Vol. 34, ed. W. B. Jakoby, M. Wilchek, 
Acad. Press, N.Y. (1974); and Immobilized Biochemicals and Affinity 
Chromatography, Advances in Experimental Medicine and Biology, vol. 42, 

20 ed. R. Dunlap, Plenum Press, N.Y. (1974)). 

Among the most commonly used methods are absorption and adsorp- 
tion or covalent binding to the support, either directly or via a linker, such 
as the numerous disulfide linkages, thioether bonds, hindered disulfide 
bonds, and covalent bonds between free reactive groups, such as amine 

25 and thiol groups, known to those of skill in art (see, e.g., the PIERCE 

CATALOG, ImmunoTechnology Catalog & Handbook, 1992-1993, which 
describes the preparation of and use of such reagents and provides a 
commercial source for such reagents; Wong, Chemistry of Protein 
Conjugation and Cross Linking, CRC Press (1993); see also DeWitt et al., 

30 Proc. Natl. Acad. Sci. U.S.A. 30:6909 (1993); Zuckermann eta/., J. Am. 
Chem. Soc. 7 74:10646 (1992); Kurth eta/., J. Am. Chem. Soc. 7 76:2661 
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(1994); Ellman et al., Proc. Natl. Acad. Sci. U.S.A. 57:4708 (1994); 
Sucholeiki, Tetrahedron Lttrs. 35:7307 (1994); Su-Sun Wang, J. Org. 
Chem. 47:3258 (1976); Padwa et al., J. Org. Chem. 47:3550 (1971); and 
Vedejs et a/., J. Org. Chem. 43:575 (1984), which describe photosensitive 
5 linkers). 

To effect immobilization, a composition containing the protein or 
other biomolecule is contacted with a support material such as alumina, 
carbon, an ion-exchange resin, cellulose, glass or a ceramic. Fluorocarbon 
polymers have been used as supports to which biomolecules have been 

10 attached by adsorption (see, U.S. Patent No. 3,843,443; Published 
International PCT Application WO/86 03840). 
J. Prognosis and diagnosis 

CVSP1 6 polypeptide proteins, domains, analogs, and derivatives 
thereof, and encoding nucleic acids (and sequences complementary 

15 thereto), and anti-CVSP16 polypeptide antibodies, can be used in 

diagnostics, particularly diagnosis of cervical cancer, colon and pancreatic 
cancers, and possibly other cancers, including prostate, colon, ovary, 
cervix and breast cancers. Such molecules can be used in assays, such as 
immunoassays, to detect, prognose, diagnose, or monitor various 

20 conditions, diseases, and disorders affecting CVSP16 polypeptide 

expression, or monitor the treatment thereof. For purposes herein, the 
presence of CVSP16s in body fluids or tumor tissues are of particular 
interest. 

In particular, such an immunoassay is carried out by a method 
25 including contacting a sample derived from a patient with an anti-CVSP16 
polypeptide antibody under conditions such that specific binding can occur, 
and detecting or measuring the amount of any specific binding by the 
antibody. Such binding of antibody, in tissue sections, can be used to 
detect aberrant CVSP16 polypeptide localization or aberrant {e.g. , 
30 increased, decreased or absent) levels of CVSP16 polypeptide or aberrant 
activity if CVSP16 or aberrant processing of CVSP16. For example, 
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antibody to CVSP1 6 polypeptide can be used to assay in a patient tissue or 
serum sample for the presence of CVSP1 6 polypeptide where an aberrant 
level of CVSP16 polypeptide is an indication of a diseased condition. 

The immunoassays which can be used include but are not limited to 
5 competitive and non-competitive assay systems using techniques such as 
western blots, radioimmunoassays, ELISA (enzyme linked immunosorbent 
assay), "sandwich" immunoassays, immunoprecipitation assays, precipitin 
reactions, gel diffusion precipitin reactions, immunodiffusion assays, 
agglutination assays, complement-fixation assays, immunoradiometric 

10 assays, fluorescent immunoassays and proteinalso isA immunoassays. 

CVSP1 6 polypeptide genes and related nucleic acid sequences and 
subsequences, including complementary sequences, also can be used in 
hybridization assays. CVSP16 polypeptide nucleic acid sequences, or 
subsequences thereof containing about at least 8 nucleotides, generally 14 

15 or 16 or 30 or more, generally less than 1000 or up to 100, contiguous 
nucleotides can be used as hybridization probes. Hybridization assays can 
be used to detect, prognose, diagnose, or monitor conditions, disorders, or 
disease states associated with aberrant changes in CVSP1 6 polypeptide 
expression and/or activity as described herein. In particular, such a 

20 hybridization assay is carried out by a method by contacting a sample 

containing nucleic acid with a nucleic acid probe capable of hybridizing to 
CVSP16 polypeptide encoding DNA or RNA, under conditions such that 
hybridization can occur, and detecting or measuring any resulting 
hybridization. 

25 In a specific embodiment, a method of diagnosing a disease or 

disorder characterized by detecting an aberrant level of a CVSP1 6 
polypeptide in a subject is provided herein by measuring the level of the 
DNA, RNA, protein or activity, such as protease and/or binding activity, of 
a CVSP1 6 polypeptide in a sample derived from the subject. An increase 

30 or decrease in the level of the DNA, RNA, protein or functional activity of 
the CVSP16 polypeptide, relative to the level of the DNA, RNA, protein or 
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functional activity found in an analogous sample not having the disease or 
disorder indicates the presence of the disease or disorder in the subject. 

Kits for diagnostic use are also provided, that contain in one or more 
containers an anti-CVSP1 6 polypeptide antibody, and, optionally, a labeled 
5 binding partner to the antibody. Alternatively, the anti-CVSP16 

polypeptide antibody can be labeled (with a detectable marker, e.g., a 
chemiluminescent, enzymatic, fluorescent, or radioactive moiety). A kit 
also is provided that includes in one or more containers a nucleic acid 
probe capable of hybridizing to SP protein-encoding RNA. In a specific 

10 embodiment, a kit can contain in one or more containers a pair of primers 
(e.g. , each in the size range of 6-30 nucleotides) that are capable of 
priming amplification, e.g., by polymerase chain reaction (see e.g., Innis et 
a/., 1990, PCR Protocols, Academic Press, Inc., San Diego, CA), ligase 
chain reaction (see EP 320,308), use of Q/? replicase, cyclic probe reaction, 

15 or other methods known in the art under appropriate reaction conditions of 
at least a portion of a SP protein-encoding nucleic acid. A kit can 
optionally further comprise in a container a predetermined amount of a 
purified CVSP16 polypeptide or nucleic acid, e.g., for use as a standard or 
control. 

20 K. Pharmaceutical compositions and modes of administration 
1 . Components of the compositions 

Pharmaceutical compositions containing the identified compounds 
that modulate the activity of a CVSP16 polypeptide are provided herein. 
Also provided are combinations of a compound that modulates an activity 
25 of a CVSP1 6 polypeptide and another treatment or compound for 

treatment of a neoplastic disorder, such as a chemotherapeutic compound. 

The CVSP16 polypeptide modulator and the anti-tumor agent can be 
packaged as separate compositions for administration together or 
sequentially or intermittently. Alternatively, they can provided as 
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a single composition for administration or as two compositions for 
administration as a single composition. The combinations can be packaged 
as kits. 

a. CVSP16 polypeptide inhibitors 

5 Any CVSP16 polypeptide inhibitors, including those described herein 

when used alone or in combination with other compounds, that can 
alleviate, reduce, ameliorate, prevent, or place or maintain in a state of 
remission of clinical symptoms or diagnostic markers associated with 
neoplastic diseases, including undesired and/or uncontrolled angiogenesis, 

10 can be used in the present combinations. 

For example, the CVSP16 polypeptide inhibitor is an antibody or 
fragment thereof that specifically reacts with a CVSP16 polypeptide or the 
protease domain thereof or other region thereof, such as the activation 
region, or is an inhibitor of the CVSP16 polypeptide production, an inhibitor 

15 of CVSP16 polypeptide membrane-localization or an inhibitor of the 
expression or activation of a CVSP16 polypeptide. 

b. Anti-angiogenic agents and anti-tumor agents 

Any anti-angiogenic agents and anti-tumor agents, including those 
described herein, when used alone or in combination with other 

20 compounds, that can alleviate, reduce, ameliorate, prevent, or place or 

maintain in a state of remission of clinical symptoms or diagnostic markers 
associated with undesired and/or uncontrolled angiogenesis and/or tumor 
growth and metastasis, particularly solid neoplasms, vascular 
malformations and cardiovascular disorders, chronic inflammatory diseases 

25 and aberrant wound repairs, circulatory disorders, crest syndromes, 
dermatological disorders, or ocular disorders, can be used in the 
combinations. Also contemplated are anti-tumor agents for use in 
combination with an inhibitor of a CVSP16 polypeptide. 
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c. Anti-tumor agents and anti-angiogenic agents 

The compounds identified by the methods provided herein or 
provided herein can be used in combination with anti-tumor agents and/or 
anti-angiogenesis agents. 
5 2. Formulations and route of administration 

The compounds herein and agents can be formulated as pharmaceut- 
ical compositions, typically for single dosage administration. The 
concentrations of the compounds in the formulations are effective for 
delivery of an amount, upon administration, that is effective for the 

10 intended treatment. Typically, the compositions are formulated for single 
dosage administration. To formulate a composition, the weight fraction of 
a compound or mixture thereof is dissolved, suspended, dispersed or 
otherwise mixed in a selected vehicle at an effective concentration such 
that the treated condition is relieved or ameliorated. Pharmaceutical 

15 carriers or vehicles suitable for administration of the compounds provided 
herein include any such carriers known to those skilled in the art to be 
suitable for the particular mode of administration. 

In addition, the compounds can be formulated as the sole 
pharmaceutically active ingredient in the composition or can be combined 

20 with other active ingredients. Liposomal suspensions, including tissue- 
targeted liposomes, also can be suitable as pharmaceutically acceptable 
carriers. These can be prepared according to methods known to those 
skilled in the art. For example, liposome formulations can be prepared as 
described in U.S. Patent No. 4,522,81 1 . 

25 The active compound is included in the pharmaceutically acceptable 

carrier in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. The 
therapeutically effective concentration can be determined empirically by 
testing the compounds in known in vivo and in vivo systems, such as the 

30 assays provided herein. 
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The concentration of active compound in the drug composition 
depends on absorption, inactivation and excretion rates of the active 
compound, the physicochemical characteristics of the compound, the 
dosage schedule, and amount administered as well as other factors known 
5 to those of skill in the art. 

Typically a therapeutically effective dosage is contemplated. The 
amounts administered can be on the order of 0.001 to 1 mg/ml, including 
about 0.005-0.05 mg/ml and about 0.01 mg/ml, of blood volume. Pharma- 
ceutical dosage unit forms are prepared to provide from about 1also ismg 

10 to about 1000 mg, including from about 10 to about 500 mg, and 
including about 25-75 mg of the essential active ingredient or a 
combination of essential ingredients per dosage unit form. The precise 
dosage can be empirically determined. 

The active ingredient can be administered at once, or can be divided 

15 into a number of smaller doses to be administered at intervals of time, It is 
understood that the precise dosage and duration of treatment is a function 
of the disease being treated and can be determined empirically using 
known testing protocols or by extrapolation from in vivo or in vitro test 
data. It is to be noted that concentrations and dosage values also can vary 

20 with the severity of the condition to be alleviated. It is to be further 

understood that for any particular subject, specific dosage regimens should 
be adjusted over time according to the individual need and the professional 
judgment of the person administering or supervising the administration of 
the compositions, and that the concentration ranges set forth herein are 

25 exemplary only and are not intended to limit the scope or use of the 
claimed compositions and combinations containing them. 

Pharmaceutically acceptable derivatives include acids, salts, esters, 
hydrates, solvates and prodrug forms. The derivative is typically selected 
such that its pharmacokinetic properties are superior to the corresponding 

30 neutral compound. 
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Thus, effective concentrations or amounts of one or more of the 
compounds provided herein or pharmaceutical^ acceptable derivatives 
thereof are mixed with a suitable pharmaceutical carrier or vehicle for 
systemic, topical or local administration to form pharmaceutical 
5 compositions. Compounds are included in an amount effective for 

ameliorating or treating the disorder for which treatment is contemplated. 
The concentration of active compound in the composition depends on 
absorption, inactivation, excretion rates of the active compound, the 
dosage schedule, amount administered, particular formulation as well as 

10 other factors known to those of skill in the art. 

Solutions or suspensions used for parenteral, intradermal, 
subcutaneous, or topical application can include any of the following 
components: a sterile diluent, such as water for injection, saline solution, 
fixed oil, polyethylene glycol, glycerine, propylene glycol or other synthetic 

15 solvent; antimicrobial agents, such as benzyl alcohol and methyl parabens; 
antioxidants, such as ascorbic acid and sodium bisulfite; chelating agents, 
such as ethylenediaminetetraacetic acid (EDTA); buffers, such as acetates, 
citrates and phosphates; and agents for the adjustment of tonicity such as 
sodium chloride or dextrose. Parenteral preparations can be enclosed in 

20 ampules, disposable syringes or single or multiple dose vials made of glass, 
plastic or other suitable material. 

In instances in which the compounds exhibit insufficient solubility, 
methods for solubilizing compounds can be used. Such methods are 
known to those of skill in this art, and include, but are not limited to, using 

25 cosolvents, such as dimethylsulfoxide (DMSO), using surfactants, such as 
Tween®, or dissolution in aqueous sodium bicarbonate. Derivatives of the 
compounds, such as prodrugs of the compounds also can be used in 
formulating effective pharmaceutical compositions. For ophthalmic 
indications, the compositions are formulated in an ophthalmically 

30 acceptable carrier. For the ophthalmic uses herein, local administration, 
either by topical administration or by injection are contemplated. Time 
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release formulations are also desirable. Typically, the compositions are 
formulated for single dosage administration, so that a single dose 
administers an effective amount. 

Upon mixing or addition of the compound with the vehicle, the 
5 resulting mixture can be a solution, suspension, emulsion or other 

composition. The form of the resulting mixture depends upon a number of 
factors, including the intended mode of administration and the solubility of 
the compound in the selected carrier or vehicle. If necessary, pharmaceuti- 
cal^ acceptable salts or other derivatives of the compounds are prepared. 

10 The compound is included in the pharmaceutical^ acceptable carrier 

in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. It is understood 
that number and degree of side effects depends upon the condition for 
which the compounds are administered. For example, certain toxic and 

15 undesirable side effects are tolerated when treating life-threatening 
illnesses that would not be tolerated when treating disorders of lesser 
consequence. 

The compounds also can be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the 

20 desired action known to those of skill in the art. The formulations of the 
compounds and agents for use herein include those suitable for oral, rectal, 
topical, inhalational, buccal (e.g., sublingual), parenteral (e.g., 
subcutaneous, intramuscular, intradermal, or intravenous), transdermal 
administration or any route. The most suitable route in any given case 

25 depends on the nature and severity of the condition being treated and on 
the nature of the particular active compound which is being used. The 
formulations are provided for administration to humans and animals in unit 
dosage forms, such as tablets, capsules, pills, powders, granules, sterile 
parenteral solutions or suspensions, and oral solutions or suspensions, and 

30 oil-water emulsions containing suitable quantities of the compounds or 
pharmaceutically acceptable derivatives thereof. The pharmaceutical^ 
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therapeutically active compounds and derivatives thereof are typically 
formulated and administered in unit-dosage forms or multiple-dosage forms. 
Unit-dose forms as used herein refers to physically discrete units suitable 
for human and animal subjects and packaged individually as is known in 
5 the art. Each unit-dose contains a predetermined quantity of the 
therapeutically active compound sufficient to produce the desired 
therapeutic effect, in association with the required pharmaceutical carrier, 
vehicle or diluent. Examples of unit-dose forms include ampoules and 
syringes and individually packaged tablets or capsules. Unit-dose forms 

10 can be administered in fractions or multiples thereof. A multiple-dose form 
is a plurality of identical unit-dosage forms packaged in a single container 
to be administered in segregated unit-dose form. Examples of 
multiple-dose forms include vials, bottles of tablets or capsules or bottles 
of pints or gallons. Hence, multiple dose form is a multiple of unit-doses 

15 which are not segregated in packaging. 

The composition can contain along with the active ingredient: a dilu- 
ent such as lactose, sucrose, dicalcium phosphate, or carboxymethylcellu- 
lose; a lubricant, such as magnesium stearate, calcium stearate and talc; 
and a binder such as starch, natural gums, such as gum acacia, gelatin, 

20 glucose, molasses, polyvinylpyrrolidine, celluloses and derivatives thereof, 
povidone, crospovidones and other such binders known to those of skill in 
the art. Liquid pharmaceutical^ administrable compositions can, for 
example, be prepared by dissolving, dispersing, or otherwise mixing an 
active compound as defined above and optional pharmaceutical adjuvants 

25 in a carrier, such as, for example, water, saline, aqueous dextrose, 
glycerol, glycols, ethanol, and the like, to thereby form a solution or 
suspension. If desired, the pharmaceutical composition to be administered 
also can contain minor amounts of nontoxic auxiliary substances such as 
wetting agents, emulsifying agents, or solubilizing agents, pH buffering 

30 agents and the like, for example, acetate, sodium citrate, cyclodextrine 
derivatives, sorbitan monolaurate, triethanolamine sodium acetate, 
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triethanolamine oleate, and other such agents. Methods of preparing such 
dosage forms are known, or will be apparent, to those skilled in this art 
(see, e.g. , Remington's Pharmaceutical Sciences, Mack Publishing 
Company, Easton, Pa., 15th Edition, 1975). The composition or 
5 formulation to be administered contains a quantity of the active compound 
in an amount sufficient to alleviate the symptoms of the treated subject. 

Dosage forms or compositions containing active ingredient in the 
range of 0.005% to 100% with the balance made up from non-toxic carrier 
can be prepared. For oral administration, the pharmaceutical compositions 

10 can take the form of, for example, tablets or capsules prepared by 

conventional means with pharmaceutical^ acceptable excipients such as 
binding agents (e.g., pregelatinized maize starch, polyvinyl pyrrolidone or 
hydroxypropyl methylcellulose); fillers {e.g., lactose, microcrystalline 
cellulose or calcium hydrogen phosphate); lubricants (e.g., magnesium 

15 stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). The tablets 
can be coated by methods well-known in the art. 

The pharmaceutical preparation also can be in liquid form, for 
example, solutions, syrups or suspensions, or can be presented as a drug 

20 product for reconstitution with water or other suitable vehicle before use. 
Such liquid preparations can be prepared by conventional means with 
pharmaceutical^ acceptable additives such as suspending agents (e.g. , 
sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., 

25 almond oil, oily esters, or fractionated vegetable oils); and preservatives 
(e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). 

Formulations suitable for rectal administration can be presented as 
unit dose suppositories. These can be prepared by admixing the active 
compound with one or more conventional solid carriers, for example, cocoa 

30 butter, and then shaping the resulting mixture. 
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Formulations suitable for topical application to the skin or to the eye 
generally are formulated as an ointment, cream, lotion, paste, gel, spray, 
aerosol and oil. Carriers which can be used include vaseline, lanoline, 
polyethylene glycols, alcohols, and combinations of two or more thereof. 
5 The topical formulations can further advantageously contain 0.05 to 1 5 
percent by weight of thickeners selected from among hydroxypropyl 
methyl cellulose, methyl cellulose, polyvinylpyrrolidone, polyvinyl alcohol, 
poly (alkylene glycols), poly/hydroxyalkyl, (meth)acrylates or 
poly(meth)acrylamides. A topical formulation is often applied by instillation 

10 or as an ointment into the conjunctival sac. It also can be used for 

irrigation or lubrication of the eye, facial sinuses, and external auditory 
meatus. It also can be injected into the anterior eye chamber and other 
places. The topical formulations in the liquid state can be also present in a 
hydrophilic three-dimensional polymer matrix in the form of a strip, contact 

15 lens, and the like from which the active components are released. 

For administration by inhalation, the compounds for use herein can 
be delivered in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, e.g., dichlorodi- 
fluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 

20 dioxide or other suitable gas. In the case of a pressurized aerosol, the 
dosage unit can be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of, e.g., gelatin, for use in an inhaler or 
insufflator can be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

25 Formulations suitable for buccal (sublingual) administration include, 

for example, lozenges containing the active compound in a flavored base, 
usually sucrose and acacia or tragacanth; and pastilles containing the 
compound in an inert base such as gelatin and glycerin or sucrose and 
acacia. 

30 The compounds can be formulated for parenteral administration by 

injection, e.g., by bolus injection or continuous infusion. Formulations for 
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injection can be presented in unit dosage form, e.g. , in ampules or in multi- 
dose containers, with an added preservative. The compositions can be 
suspensions, solutions or emulsions in oily or aqueous vehicles, and can 
contain formulatory agents such as suspending, stabilizing and/or 
5 dispersing agents. Alternatively, the active ingredient can be in powder 
form for reconstitution with a suitable vehicle, e.g., sterile pyrogen-free 
water or other solvents, before use. 

Formulations suitable for transdermal administration can be 
presented as discrete patches adapted to remain in intimate contact with 

10 the epidermis of the recipient for a prolonged period of time. Such patches 
suitably contain the active compound as an optionally buffered aqueous 
solution of, for example, 0.1 to 0.2 M concentration with respect to the 
active compound. Formulations suitable for transdermal administration also 
can be delivered by iontophoresis (see, e.g., Pharmaceutical Research 3 

15 (6), 318 (1986)) and typically take the form of an optionally buffered 
aqueous solution of the active compound. 

The pharmaceutical compositions also can be administered by 
controlled release means and/or delivery devices (see, e.g., in U.S. Patent 
Nos. 3,536,809; 3,598,123; 3,630,200; 3,845,770; 3,847,770; 

20 3,916,899; 4,008,719; 4,687,610; 4,769,027; 5,059,595; 5,073,543; 
5,120,548; 5,354,566; 5,591,767; 5,639,476; 5,674,533 and 
5,733,566). 

Desirable blood levels can be maintained by a continuous infusion of 
the active agent as ascertained by plasma levels. It should be noted that 

25 the attending physician would know how to and when to terminate, 
interrupt or adjust therapy to lower dosage due to toxicity, or bone 
marrow, liver or kidney dysfunctions. Conversely, the attending physician 
would also know how to and when to adjust treatment to higher levels if 
the clinical response is not adequate (precluding toxic side effects). 

30 The efficacy and/or toxicity of the CVSP16 polypeptide inhibitor(s), 

alone or in combination with other agents also can be assessed by the 
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methods known in the art (see, e.g., O'Reilly, Investigational New Drugs 
75:5-13 (1997)). 

The active compounds or pharmaceutical^ acceptable derivatives 
can be prepared with carriers that protect the compound against rapid 
5 elimination from the body, such as time release formulations or coatings. 

Kits containing the compositions and/or the combinations with 
instructions for administration thereof are provided. The kit can further 
include a needle or syringe, typically packaged in sterile form, for injecting 
the complex, and/or a packaged alcohol pad. Instructions are optionally 
10 included for administration of the active agent by a clinician or by the 
patient. 

Finally, the compounds or CVSP16 polypeptides or protease domains 
thereof or compositions containing any of the preceding agents can be 
packaged as articles of manufacture containing packaging material, a 

15 compound or suitable derivative thereof provided herein, which is effective 
for treatment of diseases or disorders contemplated herein, within the 
packaging material, and a label that indicates that the compound or a 
suitable derivative thereof is for treating the diseases or disorders 
contemplated herein. The label can optionally include the disorders for 

20 which the therapy is warranted. 
L. Methods of treatment 

The compounds identified by the methods herein are used for 
treating or preventing neoplastic diseases in an animal, particularly a 
mammal, including a human, and are provided herein. In one embodiment, 

25 the method includes administering to a mammal an effective amount of an 
inhibitor of a CVSP16 polypeptide, whereby the disease or disorder is 
treated or prevented. 

In an embodiment, the CVSP16 polypeptide inhibitor used in the 
treatment or prevention is administered with a pharmaceutical^ acceptable 

30 carrier or excipient. The mammal treated can be a human. The inhibitors 
provided herein are those identified by the screening assays. In addition, 
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antibodies and antisense nucleic acids or double-stranded RNA (dsRNA), 
such as RNAi, are contemplated. 

The treatment or prevention method can further include adminis- 
tering an anti-angiogenic treatment or agent or anti-tumor agent 
5 simultaneously with, prior to or subsequent to the CVSP16 polypeptide 
inhibitor, which can be any compound identified that inhibits the activity of 
a CVSP16 polypeptide. Such compounds include small molecule 
modulators, a natural product or derivative thereof, an antibody or a 
fragment or derivative thereof containing a binding region thereof against 

10 the CVSP16 polypeptide, an antisense nucleic acid or double-stranded RNA 
(dsRNA), such as RNAi, encoding a portion of the CVSP16 polypeptide (or 
the complement thereof), and a nucleic acid containing at least a portion of 
a gene encoding the CVSP16 polypeptide into which a heterologous 
nucleotide sequence has been inserted such that the heterologous 

15 sequence inactivates the biological activity of at least a portion of the gene 
encoding the CVSP16 polypeptide, in which the portion of the gene 
encoding a CVSP16 polypeptide flanks the heterologous sequence to 
promote homologous recombination with a genomic gene (or endogenous 
gene) encoding a CVSP16 polypeptide. In addition, such molecules are 

20 generally less than about 1000 nt long. 
1 . Antisense treatment 

In a specific embodiment, as described hereinabove, CVSP16 
polypeptide function is reduced or inhibited by CVSP16 polypeptide 
antisense nucleic acids, to treat or prevent neoplastic disease. The 

25 therapeutic or prophylactic use of nucleic acids of at least six nucleotides 
that are antisense to a gene or cDNA encoding CVSP16 polypeptide or a 
portion thereof. A CVSP16 polypeptide "antisense" nucleic acid as used 
herein refers to a nucleic acid capable of hybridizing to a portion of a 
CVSP16 polypeptide RNA (generally mRNA) by virtue of some sequence 

30 complementarity, and generally under high stringency conditions. The 

antisense nucleic acid can be complementary to a coding and/or noncoding 
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region of a CVSP16 polypeptide mRNA. Such antisense nucleic acids have 
utility as therapeutics that reduce or inhibit CVSP16 polypeptide function, 
and can be used in the treatment or prevention of disorders as described 
supra. 

5 The CVSP16 polypeptide antisense nucleic acids are of at least six 

nucleotides and are generally oligonucleotides (ranging from 6 to about 
150 nucleotides including 6 to 50 nucleotides). The antisense molecule 
can be complementary to all or a portion of the protease domain. For 
example, the oligonucleotide is at least 10 nucleotides, at least 15 

10 nucleotides, at least 100 nucleotides, or at least 125 nucleotides. The 

oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety and/or 
phosphate backbone. The oligonucleotide can include other appending 

15 groups such as peptides, or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., Proc. Natl. Acad. Sci. U.S.A. 
56:6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. Sci. U.S.A. 
54:648-652 (1987); PCT Publication No. WO 88/09810, published 
December 15, 1988) or blood-brain barrier (see, e.g., PCT Publication No. 

20 WO 89/10134, published April 25, 1988), hybridization-triggered cleavage 
agents (see, e.g., Krol et al., BioTechniques 6:958-976 (1988)) or 
intercalating agents (see, e.g., Zon, Pharm. Res. 5:539-549 (1988)). 

The CVSP16 polypeptide antisense nucleic acid generally is an oligo- 
nucleotide, typically single-stranded DNA or RNA or an analog thereof or 

25 mixtures thereof. For example, the oligonucleotide includes a sequence 

antisense to a portion of a huma CVSP16 polypeptide. The oligonucleotide 
can be modified at any position on its structure with substituents generally 
known in the art. 

The CVSP16 polypeptide antisense oligonucleotide can include at 

30 least one modified base moiety which is selected from the group including, 
but not limited to 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
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hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) 
uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethyl- 
aminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1 -methylguanine, 1-methylinosine, 2,2-dimethyl- 
5 guanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methyl- 
cytosine, N6-adenine, 7-methylguanine, 5-methytaminomethyluracil, 
5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'-methoxy- 
carboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, 
uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

10 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 

5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid 
(v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, 
and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide includes at least one 

15 modified sugar moiety selected from the group including but not limited to 
arabinose, 2-fluoroarabinose, xylulose, and hexose. The oligonucleotide 
can include at least one modified phosphate backbone selected from a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 

20 phosphotriester, and a formacetal or analog thereof. 

The oligonucleotide can be an a-anomeric oligonucleotide. An a- 
anomeric oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which the strands run parallel to each other 
(Gautier et a/., Nucl. Acids Res. 75:6625-6641 (1987)). 

25 The oligonucleotide can be conjugated to another molecule, e.g., a 

peptide, hybridization triggered cross-linking agent, transport agent and 
hybridization-triggered cleavage agent. 

The oligonucleotides can be synthesized by standard methods 
known in the art, e.g. by use of an automated DNA synthesizer (such as 

30 are commercially available from Biosearch, Applied Biosystems, and other 
sources). As examples, phosphorothioate oligonucleotides can be 
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synthesized by the method of Stein et al. (Nucl. Acids Res. 16:3209 
(1988)), methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin et a/., Proc. Natl. Acad. Sci. 
U.S.A. 35:7448-7451 (1988)), and others. 
5 In a specific embodiment, the CVSP16 polypeptide antisense 

oligonucleotide includes catalytic RNA or a ribozyme (see, e.g., PCT 
International Publication WO 90/11364, published October 4, 1990; Sarver 
et al., Science 247:1222-1225 (1990)). In another embodiment, the 
oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al., Nucl. Acids 

10 Res. 75:6131-6148 (1987)), or a chimeric RNA-DNA analogue (Inoue et 
al., FEBS Lett. 2 75:327-330 (1987)). Alternatively, the oligonucleotide 
can be double-stranded RNA (dsRNA) such as RNAi. 

In an alternative embodiment, the CVSP1 6 polypeptide antisense 
nucleic acid is produced intracellular^ by transcription from an exogenous 

15 sequence. For example, a vector can be introduced in vivo such that it is 
taken up by a cell, within which cell the vector or a portion thereof is 
transcribed, producing an antisense nucleic acid (RNA). Such a vector 
would contain a sequence encoding the CVSP16 polypeptide antisense 
nucleic acid. Such a vector can remain episomal or become 

20 chromosomally integrated, as long as it can be transcribed to produce the 
desired antisense RNA. Such vectors can be constructed by recombinant 
DNA technology methods standard in the art. Vectors can be plasmid, 
viral, or others known in the art, used for replication and expression in 
mammalian cells. Expression of the sequence encoding the CVSP16 

25 polypeptide antisense RNA can be by any promoter known in the art to act 
in mammalian, including human, cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited to: the SV40 early 
promoter region (Bernoist and Chambon, Nature 250:304-310 (1981), the 
promoter contained in the 3' long terminal repeat of Rous sarcoma virus 

30 (Yamamoto et al., Cell 22:787-797 (1980), the herpes thymidine kinase 
promoter (Wagner et a/., Proc. Natl. Acad. Sci. U.S.A. 78A 441 -1445 
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(1981), the regulatory sequences of the metallothionein gene (Brinster et 
at., Nature 296:39-42 (1982)). 

The antisense nucleic acids include sequence complementary to at 
least a portion of an RNA transcript of a CVSP1 6 polypeptide gene, 
5 including a huma CVSP16 polypeptide gene. Absolute complementarily is 
not required. 

The amount of CVSP16 polypeptide antisense nucleic acid that is 
effective in the treatment or prevention of neoplastic disease depends on 
the nature of the disease, and can be determined empirically by standard 
10 clinical techniques. Where possible, it is desirable to determine the 

antisense cytotoxicity in cells in vitro, and then in useful animal model 
systems prior to testing and use in humans. 
2. RNA interference 

RNA interference (RNAi) (see, e.g. Chuang et al. (2000) Proc. Natl. 

15 Acad. Sci. U.S.A. 57:4985) can be employed to inhibit the expression of a 
gene encoding a CVSP16. Interfering RNA (RNAi) fragments, particularly 
double-stranded (ds) RNAi, can be used to generate loss-of-CVSP1 6 
function. Methods relating to the use of RNAi to silence genes in 
organisms including, mammals, C. elegans, Drosophila and plants, and 

20 humans are known (see, e.g., Fire et al. (1998) Nature 35 7:806-81 1 Fire 
(1999) Trends Genet. 75:358-363; Sharp (2001) Genes Dev. 75:485-490; 
Hammond, et al. (2001) Nature Rev. Genet.2-A10-'\'\'\2', Tuschl (2001) 
Chem. Biochem. 2:239-245; Hamilton et al. (1999) Science 286:950-952; 
Hammond et aL (2000) Nature 404:293-296; Zamore et al. (2000) Cell 

25 707:25-33; Bernstein et al. (2001) Nature 409: 363-366; Elbashir et al. 
(2001) Genes Dev. 75:188-200; Elbashir et al. (2001) Nature 47 7:494- 
498; International PCT application No. WO 01/29058; International PCT 
application No. WO 99/32619). By selecting appropriate sequences, 
expression of dsRNA can interfere with accumulation of endogenous 

30 mRNA encoding a CVSP16. 
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Double-stranded RNA (dsRNA)-expressing constructs are introduced 
into a host, such as an animal or plant. This can be accomplished by any 
of numerous methods known in the art, for example by including it in a 
replicable vector, such as a viral vector (see discussion below), that 
5 remains episomal or integrates into the genome. The dsRNA can be 
introduced into an appropriate nucleic acid expression vector and 
administering it so that it becomes intracellular, e.g., by infection using a 
defective or attenuated retroviral or other viral vector (see U.S. Patent No. 
4,980,286). Other methods include, but are not limited to, direct injection 

10 of naked DNA, using microparticle bombardment (e.g., a gene gun; 
Biolistic, Dupont), coating with lipids or cell-surface receptors or 
transfecting agents, encapsulation in liposomes, microparticles, or 
microcapsules, administering it in linkage to a peptide which is known to 
enter the nucleus, administering it in linkage to a ligand subject to receptor- 

15 mediated endocytosis (see e.g., Wu and Wu, J. Biol. Chem. 262:4429- 
4432 (1987)) (which can be used to target cell types specifically 
expressing the receptors) and other methods. In other methods, a nucleic 
acid-ligand complex can be formed in which the ligand is a fusogenic viral 
peptide to disrupt endosomes, allowing the nucleic acid to avoid lysosomal 

20 degradation. In other methods, the nucleic acid can be targeted in vivo for 
cell specific uptake and expression, by targeting a specific receptor (see, 
e.g., Published International PCT application Nos. WO 92/06180, dated 
April 16, 1992 (Wu et al.); WO 92/22635, dated December 23, 1992 
(Wilson et al.); WO92/20316, dated November 26, 1992 (Findeis et al.); 

25 W093/14188, dated July 22, 1993 (Clarke eta/.), WO 93/20221, dated 
October 14, 1993 (Young)). Alternatively, the nucleic acid can be 
introduced intracellular^ and incorporated within host cell DNA for 
expression, by homologous recombination (Koller and Smithies, Proc. Natl. 
Acad. Sci. USA 56:8932-8935 (1989); Zijlstra eta/., Nature 342:435-438 

30 (1989)). 
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RNAi can be used to inhibit expression in vitro or in vivo. Regions 
include at least about 21 (or 21) nucleotides that are selective (i.e. unique) 
for CVSP16 are used to prepare the RNAi. Smaller fragments of about 21 
nucleotides can be transformed directly (i.e., in vitro or in vivo) into cells; 
5 larger RNAi dsRNA molecules are generally introduced using vectors that 
encode them. dsRNA molecules are at least about 21 bp long or longer, 
such as 50, 100, 150, 200 and longer. Methods, reagents and protocols 
for introducing nucleic acid molecules in to cells in vitro and in vivo are 
known to those of skill in the art. 

10 3. Gene Therapy 

In an exemplary embodiment, nucleic acids that include a sequence 
of nucleotides encoding a CVSP16 polypeptide or functional domains or 
derivative thereof, are administered to promote CVSP1 6 polypeptide 
function, by way of gene therapy. In this embodiment, the nucleic acid 

15 produces an encoded protein (or the nucleic acid or encoded RNA) that 
mediates a therapeutic effect by promoting CVSP16 polypeptide function. 
Any of the methods for gene therapy available in the art can be used (see, 
Goldspiel et al., Clinical Pharmacy 72:488-505 (1993); Wu and Wu, 
Biotherapy 3:87-95 (1991); Tolstoshev, An. Rev. Pharmacol. Toxicol. 

20 32:573-596 (1993); Mulligan, Science 260:926-932 (1993); and Morgan 
and Anderson, An. Rev. Biochem. 62:191-217 (1993); TIBTECH 
/7f5,/:155-215 (1993). 

For example, one therapeutic composition for gene therapy includes 
a CVSP16 polypeptide-encoding nucleic acid that is part of an expression 

25 vector that expresses a CVSP16 polypeptide or domain, fragment or 
chimeric protein thereof in a suitable host. In particular, such a nucleic 
acid has a promoter operably linked to the CVSP16 polypeptide coding 
region, the promoter being inducible or constitutive, and, optionally, tissue- 
specific. In another particular embodiment, a nucleic acid molecule is used 

30 in which the CVSP1 6 polypeptide coding sequences and any other desired 
sequences are flanked by regions that promote homologous recombination 
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at a desired site in the genome, thus providing for intrachromosomal 
expression of the SP protein nucleic acid (Koller and Smithies, Proc. Natl. 
Acad. Sci. USA 55:8932-8935 (1989); Zijlstra et aL, Nature 342:435-438 
(1989)). 

5 Delivery of the nucleic acid into a patient can be either direct, in 

which case the patient is directly exposed to the nucleic acid or nucleic 
acid-carrying vector, or indirect, in which case, cells are first transformed 
with the nucleic acid in vitro, then transplanted into the patient. These 
two approaches are known, respectively, as in vivo or ex vivo gene 
10 therapy. 

In a specific embodiment, the nucleic acid is directly administered in 
vivo, and it is expressed to produce the encoded product. This can be 
accomplished by any of numerous methods known in the art, e.g., by 
constructing it as part of an appropriate nucleic acid expression vector and 

15 administering it so that it becomes intracellular, e.g., by infection using a 
defective or attenuated retroviral or other viral vector (see U.S. Patent No. 
4,980,286), or by direct injection of naked DNA, or by use of microparticle 
bombardment (e.g., a gene gun; Biolistic, Dupont), or coating with lipids or 
cell-surface receptors or transfecting agents, encapsulation in liposomes, 

20 microparticles, or microcapsules, or by administering it in linkage to a 

peptide which is known to enter the nucleus, by administering it in linkage 
to a ligand subject to receptor-mediated endocytosis (see e.g., Wu and 
Wu, J. Biol. Chem. 262:4429-4432 (1987)) (which can be used to target 
cell types specifically expressing the receptors). In another embodiment, a 

25 nucleic acid-ligand complex can be formed in which the ligand is a 

fusogenic viral peptide to disrupt endosomes, allowing the nucleic acid to 
avoid lysosomal degradation. In yet another embodiment, the nucleic acid 
can be targeted in vivo for cell specific uptake and expression, by targeting 
a specific receptor (see, e.g., Published International PCT application Nos. 

30 WO 92/06180 dated April 16, 1992 (Wu eta/.); WO 92/22635 dated 
December 23, 1992 (Wilson eta/.); W092/20316, November 26, 1992 
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(Findeis et al.), W093/1 41 88, July 22, 1993 (Clarke etal.), WO 
93/20221, October 14, 1993 (Young)). Alternatively, the nucleic acid can 
be introduced intracellularly and incorporated within host cell DNA for 
expression, by homologous recombination (Koller and Smithies, Proc. Natl. 
5 Acad. Sci. USA 86:8932-8935 (1989); Zijlstra et al., Nature 542:435-438 
(1989)). 

In a specific embodiment, a viral vector that contains the CVSP1 6 
polypeptide nucleic acid is used. For example, a retroviral vector can be 
used (see Miller et al., Meth. Enzymol. 277:581-599 (1993)). These 

10 retroviral vectors have been modified to delete retroviral sequences that are 
not necessary for packaging of the viral genome and integration into host 
cell DNA. The CVSP16 polypeptide nucleic acid to be used in gene 
therapy is cloned into the vector, which facilitates delivery of the gene into 
a patient. More detail about retroviral vectors can be found in Boesen et 

15 al., Biotherapy 6:291-302 (1994), which describes the use of a retroviral 
vector to deliver the mdrl gene to hematopoietic stem cells in order to 
make the stem cells more resistant to chemotherapy. Other references 
illustrating the use of retroviral vectors in gene therapy are: Clowes et al., 
J. Clin. Invest. 53:644-651 (1994); Kiem etal., Blood 83:1467-1473 

20 (1994); Salmons and Gunzberg, Human Gene Therapy 4'A 29-141 (1993); 
and Grossman and Wilson, Curr. Opin. in Genetics and Devel. 3:1 10-1 14 
(1993). 

Adenoviruses are other viral vectors that can be used in gene 
therapy. Adenoviruses are especially attractive vehicles for delivering 

25 genes to respiratory epithelia. Adenoviruses naturally infect respiratory 
epithelia where they cause a mild disease. Other targets for adenovirus- 
based delivery systems are liver, the central nervous system, endothelial 
cells, and muscle. Adenoviruses have the advantage of being capable of 
infecting non-dividing cells. Kozarsky and Wilson, Current Opinion in 

30 Genetics and Development 3:499-503 (1 993) present a review of 

adenovirus-based gene therapy. Bout et al., Human Gene Therapy 5:3-10 
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(1994) demonstrated the use of adenovirus vectors to transfer genes to 
the respiratory epithelia of rhesus monkeys. Other instances of the use of 
adenoviruses in gene therapy can be found in Rosenfeld et al., Science 
252:431-434 (1991); Rosenfeld etal.., Cell 68A 43-1 55 (1992); and 
5 Mastrangeli et a/., J. Clin. Invest 37:225-234 (1993). Adeno-associated 
virus (AAV) also is used in gene therapy (Walsh et al. t Proc. Soc. Exp. Biol. 
Med. 204:289-300 (1993). 

Another approach to gene therapy involves transferring a gene to 
cells in tissue culture by such methods as electroporation, lipofection, 

10 calcium phosphate mediated transfection, or viral infection. Usually, the 
method of transfer includes the transfer of a selectable marker to the cells. 
The cells are then placed under selection to isolate those cells that have 
taken up and are expressing the transferred gene. Those cells are then 
delivered to a patient. 

15 In this embodiment, the nucleic acid is introduced into a cell prior to 

administration in vivo of the resulting recombinant cell. Such introduction 
can be carried out by any method known in the art, including but not 
limited to transfection, electroporation, microinjection, infection with a viral 
or bacteriophage vector containing the nucleic acid sequences, cell fusion, 

20 chromosome-mediated gene transfer, microcell-mediated gene transfer, 
spheroplast fusion and other delivery methods. Numerous techniques are 
known in the art for the introduction of foreign genes into cells (see e.g. , 
Loeffler and Behr, Meth. Enzymol. 27 7:599-618 (1993); Cohenef a/., 
Meth. Enzymol. 217:618-644 (1993); Cline, Pharmac. Ther. 23:69-92 

25 (1985)) and can be used, provided that the necessary developmental and 
physiological functions of the recipient cells are not disrupted. The 
technique should provide for the stable transfer of the nucleic acid to the 
cell, so that the nucleic acid is expressible by the cell and generally 
heritable and expressible by its cell progeny. 

30 The resulting recombinant cells can be delivered to a patient by 

various methods known in the art. In an embodiment, epithelial cells are 
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injected, e.g., subcutaneously. In another embodiment, recombinant skin 
cells can be applied as a skin graft onto the patient. Recombinant blood 
cells {e.g., hematopoietic stem or progenitor cells) can be administered 
intravenously. The amount of cells envisioned for use depends on the 
5 desired effect, patient state and other parameters, and can be determined 
by one skilled in the art. 

Cells into which a nucleic acid can be introduced for purposes of 
gene therapy encompass any desired, available cell type, and include but 
are not limited to epithelial cells, endothelial cells, keratinocytes, 

10 fibroblasts, muscle cells, hepatocytes; blood cells such as Talso 
islymphocytes, Balso islymphocytes, monocytes, macrophages, 
neutrophils, eosinophils, megakaryocytes, granulocytes; various stem or 
progenitor cells, in particular hematopoietic stem or progenitor cells, e.g., 
such as stem cells obtained from bone marrow, umbilical cord blood, 

15 peripheral blood, fetal liver, and other sources thereof. 

For example, a cell used for gene therapy is autologous to the 
patient. In an embodiment in which recombinant cells are used in gene 
therapy, a CVSP1 6 polypeptide nucleic acid is introduced into the cells 
such that it is expressible by the cells or their progeny, and the 

20 recombinant cells are then administered in vivo for therapeutic effect. In a 
specific embodiment, stem or progenitor cells are used. Any stem and/or 
progenitor cells which can be isolated and maintained in vitro can 
potentially be used in accordance with this embodiment. Such stem cells 
include but are not limited to hematopoietic stem cells (HSC), stem cells of 

25 epithelial tissues such as the skin and the lining of the gut, embryonic heart 
muscle cells, liver stem cells (PCT Publication WO 94/08598, dated April 
28, 1994), and neural stem cells (Stemple and Anderson, Cell 77:973-985 
(1992)). 

Epithelial stem cells (ESCs) or keratinocytes can be obtained from 
30 tissues such as the skin and the lining of the gut by known procedures 

(Rheinwald, Meth. Cell Bio. 21A.229 (1980)). In stratified epithelial tissue 
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such as the skin, renewal occurs by mitosis of stem cells within the 
germinal layer, the layer closest to the basal lamina. Stem cells within the 
lining of the gut provide for a rapid renewal rate of this tissue. ESCs or 
keratinocytes obtained from the skin or lining of the gut of a patient or 
5 donor can be grown in tissue culture (Rheinwald, Meth. Cell Bio. 2 7/4:229 
(1980); Pittelkow and Scott, Cano Clinic Proc. 67:771 (1986)). If the 
ESCs are provided by a donor, a method for suppression of host versus 
graft reactivity (e.g. , irradiation, drug or antibody administration to promote 
moderate immunosuppression) also can be used. 

10 With respect to hematopoietic stem cells (HSC), any technique 

which provides for the isolation, propagation, and maintenance in vitro of 
HSC can be used in this embodiment. Techniques by which this can be 
accomplished include (a) the isolation and establishment of HSC cultures 
from bone marrow cells isolated from the future host, or a donor, or (b) the 

15 use of previously established long-term HSC cultures, which can be 

allogeneic or xenogeneic. Non-autologous HSC generally are used with a 
method of suppressing transplantation immune reactions of the future 
host/patient. In a particular embodiment, human bone marrow cells can be 
obtained from the posterior iliac crest by needle aspiration (see, e.g., Kodo 

20 et a/., J. Clin. Invest. 73:1377-1384 (1984)). For example, the HSCs can 
be made highly enriched or in substantially pure form. This enrichment can 
be accomplished before, during, or after long-term culturing, and can be 
done by any techniques known in the art. Long-term cultures of bone 
marrow cells can be established and maintained by using, for example, 

25 modified Dexter cell culture techniques (Dexter eta/., J. Cell Physiol. 
5 7:335 (1977) or Witlock-Witte culture techniques (Witlock and Witte, 
Proc. Natl. Acad. Sci. USA 75:3608-3612 (1982)). 

In a specific embodiment, the nucleic acid to be introduced for 
purposes of gene therapy includes an inducible promoter operably linked to 

30 the coding region, such that expression of the nucleic acid is controllable 
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by controlling the presence or absence of the appropriate inducer of 
transcription. 

3. Prodrugs 

A method for treating tumors is provided. The method is practiced 
5 by administering a prodrug that is cleaved at a specific site by a CVSP1 6 
to release an active drug or a precursor that can be converted to active 
drug in vivo. Upon contact with a cell that expresses CVSP16 activity, the 
prodrug is converted into an active drug. The prodrug can be a conjugate 
that contains the active agent, such as an anti-tumor drug, such as a 

10 cytotoxic agent, or other therapeutic agent (TA), linked to a substrate for 
the targeted CVSP16, such that the drug or agent is inactive or unable to 
enter a cell, in the conjugate, but is activated upon cleavage. The prodrug, 
for example, can contain an oligopeptide, typically a relatively short, less 
than about 10 amino acids peptide, that is proteolytically cleaved by the 

15 targeted CVSP16. Cytotoxic agents, include, but are not limited to, 
alkylating agents, antiproliferative agents and tubulin binding agents. 
Others include, vinca drugs, mitomycins, bleomycins and taxanes. 
M. Animal models 

Transgenic animal models and animals, such as rodents, including 

20 mice and rats, cows, chickens, pigs, goats, sheep, monkeys, including 
gorillas, and other primates, are provided herein. In particular, transgenic 
non-human animals that contain heterologous nucleic acid encoding a 
CVSP16 polypeptide or a transgenic animal in which expression of the 
polypeptide has been altered, such as by replacing or modifying the 

25 promoter region or other regulatory region of the endogenous gene are 
provided. Such an animal can by produced by promoting recombination 
between endogenous nucleic acid and an exogenous CVSP16 gene that 
could be over-expressed or mis-expressed, such as by expression under a 
strong promoter, via homologous or other recombination event. 
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Transgenic animals can be produced by introducing the nucleic acid 
using any known method of delivery, including, but not limited to, 
microinjection, lipofection and other modes of gene delivery into a germline 
cell or somatic cells, such as an embryonic stem cell. Typically the nucleic 
5 acid is introduced into a cell, such as an embryonic stem cell (ES), followed 
by injecting the ES cells into a blastocyst, and implanting the blastocyst 
into a foster mother, which is followed by the birth of a transgenic animal. 

Generally, introduction of a heterologous nucleic acid molecule into a 
chromosome of the animal occurs by a recombination between the 

10 heterologous CVSP1 6-encoding nucleic acid and endogenous nucleic acid. 
The heterologous nucleic acid can be targeted to a specific chromosome. 

In some instances, knockout animals can be produced. Such an 
animal can be initially produced by promoting homologous recombination 
between a CVSP16 polypeptide gene in its chromosome and an exogenous 

15 CVSP16 polypeptide gene that has been rendered biologically inactive 
(typically by insertion of a heterologous sequence, e.g., an antibiotic 
resistance gene). In one embodiment, this homologous recombination is 
performed by transforming embryo-derived stem (ES) cells with a vector 
containing the insertionally inactivated CVSP16 polypeptide gene, such 

20 that homologous recombination occurs, followed by injecting the ES cells 
into a blastocyst, and implanting the blastocyst into a foster mother, 
followed by the birth of the chimeric animal ("knockout animal") in which a 
CVSP16 polypeptide gene has been inactivated (see Capecchi, Science 
244:1288-1292 (1989)). The chimeric animal can be bred to produce 

25 homozygous knockout animals, which can then be used to produce 

additional knockout animals. Knockout animals include, but are not limited 
to, mice, hamsters, sheep, pigs, cattle, and other non-human mammals. 
For example, a knockout mouse is produced. The resulting animals can 
serve as models of specific diseases, such as cancers, that exhibit under- 

30 expression of a CVSP16 polypeptide. Such knockout animals can be used 
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as animal models of such diseases e.g., to screen for or test molecules for 
the ability to treat or prevent such diseases or disorders. 

Other types of transgenic animals also can be produced, including 
those that over-express the CVSP1 6 polypeptide. Such animals include 
5 "knock-in" animals that are animals in which the normal gene is replaced 
by a variant, such as a mutant, an over-expressed form, or other form. For 
example, one species', such as a rodent's endogenous gene can be 
replaced by the gene from another species, such as from a human. 
Animals also can be produced by non-homologous recombination into other 
10 sites in a chromosome; including animals that have a plurality of integration 
events. 

After production of the first generation transgenic animal, a chimeric 
animal can be bred to produce additional animals with over-expressed or 
mis-expressed CVSP16 polypeptides. Such animals include, but are not 

15 limited to, mice, hamsters, sheep, pigs, cattle and other non-human 

mammals. The resulting animals can serve as models of specific diseases, 
such as cancers, that exhibit over-expression or mis-expression of a 
CVSP16 polypeptide. Such animals can be used as animal models of such 
diseases e.g., to screen for or test molecules for the ability to treat or 

20 prevent such diseases or disorders. In a specific embodiment, a mouse 
with over-expressed or mis-expressed CVSP16 polypeptide is produced. 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. 
25 EXAMPLE 1 

Identification of CVSP16 

The protein sequence of the protease domain of matriptase (MTSP1 ; 
accession number AF1 18224) was used to search the human HTGS (High 
Throughput Genomic Sequence) database using the blastn algorithm 
30 (www.ncbi.nlm.nih.gov/BLAST). This search and alignment algorithm 
compares a protein query sequence against a nucleotide sequence 
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database dynamically translated in all six reading frames (both strands). 
Among the proteases identified was the protease designated herein as 
CVSP16. The partial protein sequence of the CVSP16 protease domain in 
the database shares 34% identity to the protease domain of matriptase. A 
5 search using the algorithm blastp (www.ncbi.nlm.nih.gov/BLAST ) 

indicated that the translated sequence of CVSP16 showed 34% identity to 
prostamin (BAB20376.1 ), 36% identity to corin (NP_006578.1 ), 36% 
identity to marapsin (NP1 14154.1), 35% identity to prostasin 
(NP_002764.1) / 39% identity to transmembrane tryptase (NP036599.1 ) 
10 and 36% identity to serine protease 22 (NP071 402.1 ). Based on the 
incomplete and unordered human genome sequence 

(www.ncbi.nlm.nih.gov/genome/seq), CVSP1 6 appears to be localized on 
chromosome 16 (locus: 16p13.3; clone accession number AC009088.7). 
A search of sequences deposited in GenBank showed that one entry 

15 (XM 097026.1) corresponding to a hypothetical, genomic sequence- 
derived protein had homologous nucleotide sequence (74%) with CVSP16, 
although the reported translated protein sequence only had 10% homology 
to the partial CVSP16 polypeptide sequence. A search of the EST 
database showed the existence of two EST clones (AW450407 from 

20 human colon and AM 90509 from human fetal heart). Both EST clones 
shared 98-99% identical sequence to a portion of the CVSP1 6 protease 
domain sequence. 

Cloning of CVSP16 from human small intestine cell line using RACE 
reactions 

25 Using the EST-derived cDNA sequence homologous to CVSP16, four 

oligonucleotide primers hybridizing within the protease domain sequence 
were designed and synthesized. The sequence for the 5' end primer was 
5'-CCCTCTGGGTAGCCAGCACACAGCATC-3' SEQ ID No. 7 and that of 
the 3' end primer was 5'-GCCATCGTGGTGCCGGCCAACTACAG-3' SEQ 

30 ID No. 8. The sequence for the nested 5' end primer was 
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5'-GCACACAGCATCCCTGGCAATATCTGG-3' SEQ ID No. 9 and that of 
the nested 3'end primer was 5'-CGGCCAACTACAGCCAAGTGGAGCTG-3' 
SEQ ID No. 10. 

The first set of RACE primers were used to amplify cDNA fragments 
5 from human small intestine Marathon-ready cDNA library (catalog number 
7426-1; www.clontech.com). Following this, nested RACE reactions were 
performed using the nested primers. Several DNA bands were detected in 
all RACE reactions. The pool of cDNA fragments larger than 500 bp was 
isolated by 2% agarose electrophoresis and purified from the nested 5'- 

10 and 3'-RACE reactions using the MinElute gel extraction kit (catalog 

number 28606; www.qiagen.com), then subcloned into an E. coli vector 
(pCR2.1TOPO; catalog no. K-4500-01; www.invitrogen.com) and 
transformed into £. coli TOPI 0 cells (www.invitrogen.com). To identify 
clones that contained CVSP16 cDNA, colony hybridization was performed 

15 using a — 300-bp cDNA probe amplified from the nested 5'- and 3'-RACE 
primers on the same small intestine Marathon cDNA library. Subsequent 
sequence analysis confirmed that the nucleotide sequence of these 5 # - and 
3'-RACE-derived clones matched that of the CVSP16 sequence using a 
fluorescent dye-based DNA sequencing method (catalog number 4390244; 

20 ABI PRISM 0 BigDye™ Terminator v 3.0 Ready Reaction Cycle Sequencing 
Kits with AmpliTaq 0 DNA Polymerase, FS; home.appliedbiosystems.com). 
A methionine start codon was missing from the cDNA clone, but an in- 
frame stop codon could be found in the coding sequence; thus, the 5'- 
RACE product did not extend to the beginning of the coding sequence. 

25 PCR amplification of the full-length cDNA encoding CVSP16 

A search of the Incyte proprietary database showed a clone 
(228456.2) that had homology to the CVSP16 partial sequence. Based on 
this sequence, several primer sets were designed and synthesized. To 
obtain the full-length CVSP1 6 cDNA, 2 pairs of gene-specific primers that 

30 amplify 2 overlapping portions of the CVSP16 sequence and the cDNA 
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library made from human liver were used. The first pair of primers used 
was: 

5'-AIGGCCCGGCAGCTGCTCCTCCCCCTTGTG-3' SEQ ID No. 1 1 for the 
5' end (putative start codon underlined) and 
5 5'-CGGCTCCCGGGCAGGAAGTAGTGTTCCG-3' SEQ ID No. 1 2 for the 3' 
end. This primer pair amplified the initial half of the CVSP16 sequence. 
The second pair of primers used was: 5'- 

TGGGTCTTGGCACCTGCCAGCTGCTTTCTG-3' SEQ ID No. 15 at the 5' 
end and 

10 5'-GAAGGGGGAAGTGGTGCTGGGACCCTAG-3' SEQ ID No. 16 for the 3' 
end. This pair amplified the last half of the CVSP16 sequence and the 3'- 
end primer corresponds to the sequence downstream of the putative stop 
codon. Two cDNA fragments ( — 1.3 and —1.4 kbp) were amplified using 
these 2 sets of primers. The PCR products were isolated by 2% agarose 

15 electrophoresis, purified using the MinElute gel extraction kit 
(www.qiagen.com) and subcloned into pCR2.1TOPO 

(www.invitrogen.com). Sequence analysis was performed to confirm the 
nucleotide sequence. 

The full-length coding region of CVSP16 was prepared by stitching 

20 the two cDNA fragments using PCR. A —2.3 kbp fragment was amplified, 
isolated by 1 % agarose electrophoresis, purified, subcloned and sequenced 
as described above. The sequence obtained from this gene-specific 
amplification of CVSP16 matched those sequences obtained from both 5'- 
and 3'-RACE reactions. In addition the missing 5' end containing a start 

25 codon was present. 

Gene expression profile of CVSP16 in normal, tumor tissues and cell lin s 

To obtain information regarding the gene expression profile of the 
CVSP16 transcript, a -370-bp CVSP16 cDNA fragment was used to 
probe a dot blot composed of polyA + RNAs extracted and purified from 76 

30 different human tissues (Human Multiple Tissue Expression (MTE) Array; 
catalog no. 7775-1; www.clontech.com). The cDNA probe was amplified 
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from human small intestine Marathon cDNA library using the following 
primers: 5' end primer, 5'-CCCTCTGGGTAGCCAGCACACAGCATC-3' SEQ 
ID No. 17, and 3' end primer, 5'-GCCATCGTGGTGCCGGCCAACTACAG-3' 
SEQ ID No. 1 8. The results indicate that the CVSP1 6 transcript is 
5 strongly expressed in several tissues including kidney, stomach, colon, 
spleen, thyroid gland, trachea and pituitary gland. The CVSP16 transcript 
also is found in many other tissues albeit at a lower level. Among tumor 
cell lines, the CVSP16 transcript is found (in decreasing signal intensity) in 
cervical Hela S3, lung A549, leukemia K-562, Burkitt's lymphoma Raji, 

10 leukemia HL-60, colorectal SW480, Burkitt's lymphoma Daudi and 

leukemia MOLT-4. To compare the expression profile of CVSP16 transcript 
in a range of normal human and matched tumor tissues, a matched 
tumor/normal expression array (catalog number 7840-1; 
www.clontech.com) composed of 68 paired cDNA samples from individual 

15 patients was used. Results show that the CVSP16 transcript is expressed 
at a low level in a number of normal tissues including breast, prostate, 
cervix, uterus, colon, lung, small intestine, stomach, kidney and rectum, 
but is not differentially expressed in any of the matched tumors. 
Several SMART™ 5'-RACE cDNA libraries (catalog number K1 811-1; 

20 www.clontech.com) prepared from normal breast, normal testes, normal 
prostate, prostate cancer cell lines and breast cancer cell lines were 
analyzed for the presence of CVSP1 6 transcript by RT-PCR using gene- 
specific primers. The primer sequences were: 

5'-ATCGTGGTGCCGGCCAACTACAGCCAAGTG-3' SEQ ID No. 19 for the 
25 5' end primer and 5'-ACCCATCACCTGCTCCCGTATCCATGCCTC-3' SEQ 
ID No. 20 for the 3' end primer. The CVSP16 transcript was detected (in 
decreasing signal intensity) in normal breast, normal prostate, breast 
carcinoma cell line DU4475, prostate carcinoma cell line PC-3, prostate 
carcinoma cell line LNCaP, breast carcinoma cell line MDA-MB-231 , and 
30 breast carcinoma cell line MDA-MB-453. 
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Structural features 

The CVSP16 contains a signal peptide sequence (aa 1 to aa 23) and 
a trypsin-like serine protease domain designated herein as CVSP16 PD1 (aa 
46 to aa 286) characterized by the presence of a protease activation 
5 cleavage site (...R 46 i l 47 VGGSNAQP..., where I indicates protease 

activation cleavage site) at the beginning of the domain and the catalytic 
triad residues (H 87 , D 139 and S 243 ) in 3 highly-conserved regions of the 
catalytic domain. In addition, CVSP16 has an additional 465-amino acid 
sequence (aa 287 to aa 752) beyond the protease domain. Analysis of this 

10 465-amino acid long region indicates the presence of a second protease 
domain (aa 323 to aa 550, designated herein as CVSP16 PD2). In this 
domain, however, the invariant catalytic histidine is replaced by a serine 
(S 363 ) residue, and the highly conserved SGGP sequence that contains the 
catalytic serine has been replaced with the sequence S 510 RWS. The 

15 starting residue is unusual, suggesting that cleavage may not be needed for 
activation. These sequences and differences from other protease domains 
indicate that the second protease domain has lower catalytic activity. 
CVSP16 has 8 putative /V-linked glycosylation sites (...N 92 GT..., 

. . . N i 3 qYS . . . , . . . N 2 <| 7 LT. . . , . . . N 3 <| 7CT. . . , . . . N 3ggSS . . . f . . . N 4 q 2 /\S • • ■ # « • • ^421 . . . , 

20 ...N 508 DS...). The following cysteine pairings are noted: C 72 -C 88 , C 173 -C 24 9, 
^206*^228' C 239 -C 267 , C 348 -C 364 , C 444 -C 516 , C 472 -C 494 and C 506 -C 534 . In addition, 
an unpaired cysteine (C 159 ) in the first protease domain should pair with 
C 38 . An unpaired cysteine (C 430 ) in the second protease domain should pair 
with C 325 or, less likely, C 318 . PD1 and PD2 have an additional Cys (C 208 

25 and C 474 , respectively) that is unpaired or pairs with a Cys outside of each 
protease domain. The protein has a C-terminal domain beyond 550. 
Homology of CVSP16 to other serine proteases 

Clustal W alignment (using MacVector; version 6.5.3; www.accel- 
rys.com/products/macvector/index.html) of the derived CVSP16 full-length 

30 cDNA and protein sequences with those of the cDNA and protein 

sequences derived from the Incyte clone (228456.2) showed an 81% and 
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86% sequence identity, respectively. Alignment {bJastp; 
www.ncbi.nlm.nih.gov/BLAST) of the protease domain (minus the 465- 
amino acid extension at the C terminus) sequence of CVSP16 shows 40% 
identity with that of human matriptase/MTSP-1 (accession number 
5 NP 068813), 40% identity with that of human prostamin (accession 
number BAB20376.1), 40% identity with that of human marapsin 
(accession number NP 1 14154.1), 39% identity with that of human 
enterokinase (accession number NP 002763.1 ), 40% identity with that of 
human prostasin (accession number NP002764.1 ), 39% identity with that 
10 of human corin (accession number NP006587.1 ), 44% identity with that 
of human transmembrane tryptase (accession number NP 036599.1 ), and 
37% identity with that of human plasma kallikrein (accession number 
NP 000883.1). 

The CVSP16 and encoding nucleic acid has homology to a clones 

15 described in International PCT application No. WO 02/000860 (see SEQ ID 
No. 111 therein) and to clones in International PCT application No. WO 
02/046383, International PCT application No. WO 01075067 and EP 
1 1 30094. The clones and predicted encoded polypeptides described in 
the PCT applications and EP application, however, differ from the nucleic 

20 acid molecule encoding CVSP16 polypeptides and the CVSP16 

polypeptides provided herein. For example, each of the nucleic acid 
molecules described in International PCT application No. WO 02/00860 
includes a sequence of nucleotides encoding the sequence of amino acids 
set forth in SEQ ID 21 herein, and the disclosed polypeptides include the 

25 sequence of amino acids set forth in SEQ ID 21 herein. None of the 
polypeptides provided herein include at least 5, 10, 15, 20 or more 
contiguous amino acids from SEQ ID No. 21, particularly between Gin 660 
and Met 661 (SEQ ID No. 6) or between the corresponding amino acids in 
other CVSP16s. Hence the CVSP16s provided herein include the sequence 

30 Gln 660 Met 661 , particularly, the contiguous sequence Gly His Gin Met Thr Ser 
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(see, SEQ ID No. 6, amino acids 658-663). or Leu Pro Gin Gly His Gin Met 
Thr Ser Ala (see, SEQ ID No. 6, amino acids 655-664). 
Sequence analysis 

CVSP16 cDNA and protein sequences were analyzed using 
MacVector nucleic acid/protein sequence analysis program. The full length 
cDNA encoding CVSP16 is 2,293 bp long containing a 2,259-bp open 
reading frame, which translates to a 752-amino acid protein. The cDNA 
encoding the active protease domain is 717 bp long which translates to a 
239-amino acid domain. The G + C content of the CVSP16 cDNA is 68%. 
Attached are the cDNA sequence and the translated protein sequence of 
CVSP16 (see, also, SEQ ID Nos. 5 and 6). 
CVSP16 full length cDNA and translated protein sequence 
Sequence Range: 1 to 2293 

10 20 30 40 50 60 

ATGGCCCGGCAGCTGCTCCTCCCCCTTGTGGTGCTTGTCATCAGTCCCATCCCAGGAGCC 
TACCGGGCCGTCGACGAGGAGGGGGAACACCACGAACAGTAGTCAGGGTAGGGTCCTCGG 
MARQLLLPLVVLVI SPI PG A> 
70 80 90 100 110 120 

TTCCAGGACTCAGCTCTCAGTCCTACCCAGGAAGAACCTGAAGATCTGGACTGCGGGCGC 
AAGGTCCTGAGTCGAGAGTCAGGATGGGTCCTTCTTGGACTTCTAGACCTGACGCCCGCG 
FQDSALS PTQEEPEDLDCG R> 
130 140 150 160 170 180 

CCTGAGCCCTCGGCCCGCATCGTGGGGGGCTCAAACGCGCAGCCGGGCACCTGGCCTTGG 
GGACTCGGGAGCCGGGCGTAGCACCCCCCGAGTTTGCGCGTCGGCCCGTGGACCGGAACC 
PEPSARIVGGSNAQPGTWPW> 
190 200 210 220 230 240 

CAAGTGAGCCTGCACCATGGAGGTGGCCACATCTGCGGGGGCTCCCTCATCGCCCCCTCC 
GTTCACTCGGACGTGGTACCTCCACCGGTGTAGACGCCCCCGAGGGAGTAGCGGGGGAGG 
QVSLHHGGGHICGGSLIAPS> 
250 260 270 280 290 300 

TGGGTCCTCTCCGCCGCTCACTGTTTCATGACGAATGGGACGCTGGAGCCCGCGGCCGAG 
ACCCAGGAGAGGCGGCGAGTGACAAAGTACTGCTTACCCTGCGACCTCGGGCGCCGGCTC 
WVLSAAHCFMTNGTLEPAA E> 
310 320 330 340 350 360 

TGGTCGGTACTGCTGGGCGTGCACTCCCAGGACGGGCCCCTGGACGGCGCGCACACCCGC 
ACCAGCCATGACGACCCGCACGTGAGGGTCCTGCCCGGGGACCTGCCGCGCGTGTGGGCG 
WSVLLGVHSQDGPLDGAHTR> 
370 380 390 400 410 420 

GCAGTGGCCGCCATCGTGGTGCCGGCCAACTACAGCCAAGTGGAGCTGGGCGCCGACCTG 
CGTCACCGGCGGTAGCACCACGGCCGGTTGATGTCGGTTCACCTCGACCCGCGGCTGGAC 
AVAA IVVPANYSQVELGAD L> 
430 440 450 460 470 480 

GCCCTGCTGCGCCTGGCCTCACCCGCCAGCCTGGGCCCCGCCGTGTGGCCTGTCTGCCTG 
CGGGACGACGCGGACCGGAGTGGGCGGTCGGACCCGGGGCGGCACACCGGACAGACGGAC 
ALLRLAS PASLGPAVWPVC L> 
490 500 510 520 530 540 

CCCCGCGCCTCACACCGCTTCGTGCACGGCACCGCCTGCTGGGCCACCGGCTGGGGAGAC 
GGGGCGCGGAGTGTGGCGAAGCACGTGCCGTGGCGGACGACCCGGTGGCCGACCCCTCTG 
PRASHRFVHGTACWATGWG D> 
550 560 570 580 590 600 

GTCCAGGAGGCAGATCCTCTGCCTCTCCCCTGGGTGCTACAGGAAGTGGAGCTAAGGCTG 
CAGGT C CTC CGTCTAGGAGACGGAGAGGGGAC C C ACGATGTC CTTCAC CTCGATTCCGAC 
VQEADPLPLPWVLQEVELRL> 
610 620 630 640 650 660 

CTGGGCGAGGCCACCTGTCAATGTCTCTACAGCCAGCCCGGTCCCTTCAACCTCACTCTC 
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GACCCGCTCCGGTGGACAGTTACAGAGATGTCGGTCGGGCCAGGGAAGTTGGAGTGAGAG 
LGEATCQCLYSQPGPFNLT L> 
670 680 690 700 710 720 

CAGATATTGCCAGGGATGCTGTGTGCTGGCTACCCAGGGGGCCGCAGGGACACCTGCCAG 
GTCTATAACGGTCCCTACGACACACGACCGATGGGTCCCCCGGCGTCCCTGTGGACGGTC 
Q I LPGMLCAGYPGGRRDTC Q> 
730 740 750 760 770 780 

GGTGACTCTGGGGGGCCCCTGGTCTGTGAGGAAGGCGGCCGCTGGTTCCAGGCAGGAATC 
CCACTGAGACCCCCCGGGGACCAGACACTCCTTCCGCCGGCGACCAAGGTCCGTCCTTAG 
GDSGGPLVCEEGGRWFQAGI> 
790 800 810 820 830 840 

ACCAGCTTTGGCTTTGGCTGTGGACGGAGAAACCGCCCTGGAGTTTTCACTGCTGTGGCT 
TGGTCGAAACCGAAACCGACACCTGCCTCTTTGGCGGGACCTCAAAAGTGACGACACCGA 
TSFGFGCGRRNRPGVFTAV A> 
850 860 870 880 890 900 

ACCTATGAGGCATGGATACGGGAGCAGGTGATGGGTTCAGAGCCTGGGCCTGCCTTTCCC 
TGGATACTCCGTACCTATGCCCTCGTCCACTACCCAAGTCTCGGACCCGGACGGAAAGGG 
TYEAWIREQVMGSEPGPAF P> 
910 920 930 940 950 960 

AC C C AGCCC C AGAAGAC C C AGT C AGAT C C C CAGGAGC C C AGGGAGGAGAACTGC AC C ATT 
TGGGTCGGGGTCTTCTGGGTC AGT CT AGGGGT C CT CGGGT C C CTC CT CTTGACGTGGT AA 
TQPQKTQSDPQEPREENCTI> 
970 980 990 1000 1010 1020 

GCCCTGCCTGAGTGCGGGAAGGCCCCGCGGCCAGGGGCCTGGCCCTGGGAGGCCCAGGTG 
CGGGACGGACTCACGCCCTTCCGGGGCGCCGGTCCCCGGACCGGGACCCTCCGGGTCCAC 
ALPECGKAPRPGAWPWEAQ V> 
1030 1040 1050 1060 1070 1080 

ATGGTGCCAGGATCCAGACCCTGCCATGGGGCGCTGGTGTCTGAAAGCTGGGTCTTGGCA 
TACCACGGTCCTAGGTCTGGGACGGTACCCCGCGACCACAGACTTTCGACCCAGAACCGT 
MVPGSRPCHGALVSESWVLA> 
1090 1100 1110 1120 1130 1140 

CCTGCCAGCTGCTTTCTGGACCCGAACAGCTCCGACAGCCCACCCCGCGACCTCGACGCC 
GGACGGTCGACGAAAGACCTGGGCTTGTCGAGGCTGTCGGGTGGGGCGCTGGAGCTGCGG 
PASCFLDPNSSDS PPRDLD A> 
1150 1160 1170 1180 1190 1200 

TGGCGCGTGCTGCTGCCCTCGCACCCGCGCGCGGAGCGGGTGGCGCGCCTGGTGCAGCAC 
ACCGCGCACGACGACGGGAGCGTGGGCGCGCGCCTCGCCCACCGCGCGGACCACGTCGTG 
WRVLLiPSHPRAERVARLVQ H> 
1210 1220 1230 1240 1250 1260 

GAGAACGCTTCGTGGGACAACGCCCCGGACCTGGCGCTGCTGCAGCTGCGCACGCCCGTG 
CTCTTGCGAAGCACCCTGTTGCGGGGCCTGGACCGCGACGACGTCGACGCGTGCGGGCAC 
ENASWDNAPDLALLQLRTP V> 
1270 1280 1290 1300 1310 1320 

AACCTGAGTGCGGCTTCGCGGCCCGTGTGCCTACCCCACCCGGAACACTACTTCCTGCCC 
TTGGACTCACGCCGAAGCGCCGGGCACACGGATGGGGTGGGCCTTGTGATGAAGGACGGG 
NLSAASRPVCLPHPEHYFL P> 
1330 1340 1350 1360 1370 1380 

GGGAGCCGCTGCCGCCTGGCCCGCTGGGGCCGCGGGGAACCCGCGCTTGGCCCAGGCGCG 
CCCTCGGCGACGGCGGACCGGGCGACCCCGGCGCCCCTTGGGCGCGAACCGGGTCCGCGC 
GSRCRLARWGRGEPALGPG A> 
1390 1400 1410 1420 1430 1440 

CTGCTGGAGGCGGAGCTGTTAGGCGGCTGGTGGTGCCACTGCCTGTACGGCCGCCAGGGG 
GACGACCTCCGCCTCGACAATCCGCCGACCACCACGGTGACGGACATGCCGGCGGTCCCC 
LLEAELLGGWWCHCLYGRQ G> 
1450 1460 1470 1480 1490 1500 

GCGGCAGTACCGCTGCCCGGAGACCCGCCGCACGCGCTCTGCCCTGCCTACCAGGAAAAG 
CGCCGTCATGGCGACGGGCCTCTGGGCGGCGTGCGCGAGACGGGACGGATGGTCCTTTTC 
AAVPLPGDPPHALCPAYQE K> 
1510 1520 1530 1540 1550 1560 

GAGGAGGTGGGC AG CTGCTGGAATGACT C GCGTTGGAGC CTTTTGTGC C AGGAGGAGGGG 
CTCCTCCACCCGTCGACGACCTTACTGAGCGCAACCTCGGAAAACACGGTCCTCCTCCCC 
EEVGSCWNDSRWSLLCQEE G> 
1570 1580 1590 1600 1610 1620 

ACCTGGTTTCTGGCTGGAATCAGAGACTTTCCCAGTGGCTGTCTACGTCCCCGAGCCTTC 
TGGACCAAAGACCGACCTTAGTCTCTGAAAGGGTCACCGACAGATGCAGGGGCTCGGAAG 
TWFLAGI RDFPSGCLRPRA F> 
1630 1640 1650 1660 1670 16 80 

TTCCCTCTGCAGACTCATGGCCCATGGATCAGCCATGTGACTCGGGGAGCCTACCTGGAG 
AAGGGAGACGTCTGAGTACCGGGTACCTAGTCGGTACACTGAGCCCCTCGGATGGACCTC 
FPLQTHGPW I SHVTRGAYL E> 
1690 1700 1710 1720 1730 1740 

GAC C AGC T AGCCTGGGACTGGGGC C CTGATGGGGAGGAGACTGAGAC ACAGACTTGT C CC 
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CTGGTCGATCGGACCCTGACCCCGGGACTACCCCTCCTCTGACTCTGTGTCTGAACAGGG 
DQLAWDWGPDGEETETQTC P> 
1750 1760 1770 1780 1790 1800 

C CAC AC AC AGAG C ATGGTGCCTGTGGC CTG CGGCTGGAGGCTGCTC CAGTGGGGGTC CTG 
GGTGTGTGTCTCGTACCACGGACACCGGACGCCGACCTCCGACGAGGTCACCCCCAGGAC 
PHTEHGACGLRIiEAAPVGV l> 
1810 1820 1830 1840 1850 1860 

TGGCCCTGGCTGGCAGAGGTGCATGTGGCTGGTGATCGAGTCTGCACTGGGATCCTCCTG 
ACCGGGACCGACCGTCTCCACGTACACCGACCACTAGCTCAGACGTGACCCTAGGAGGAC 
WPWLAEVHVAGDRVCTG I L L> 
1870 1880 1890 1900 1910 1920 

GCCCCAGGCTGGGTCCTGGCAGCCACTCACTGTGTCCTCAGGCCAGGCTCTACAACAGTG 
CGGGGTCCGACCCAGGACCGTCGGTGAGTGACACAGGAGTCCGGTCCGAGATGTTGTCAC 
APGWVLAATHCVLRPGSTT V> 
1930 1940 1950 1960 1970 1980 

C CTTAC ATTGAAGTGTAT CTGGGC CGGG C AGGGGC C AGCT C CC TC C C AC AGGGC C AC C AG 
GGAATGTAACTTCACATAGACCCGGCCCGTCCCCGGTCGAGGGAGGGTGTCCCGGTGGTC 
PYI EVYLGRAGASSLPQGH Q> 
1990 2000 2010 2020 2030 2040 

ATGACCTCAGCACCGCCCCTCCTGTGCCAGATGACGGAAGGGTCCTGGATCCTCGTGGGC 
TACTGGAGTCGTGGCGGGGAGGACACGGTCTACTGCCTTCCCAGGACCTAGGAGCACCCG 
MTSAPPLLCQMTEGSWILV G> 
2050 2060 2070 2080 2090 2100 

ATGGCTGTTCAAGGGAGCCGGGAGCTGTTTGCTGCCATTGGTCCTGAAGAGGCCTGGATC 
TACCGACAAGTTCCCTCGGCCCTCGACAAACGACGGTAACCAGGACTTCTCCGGACCTAG 
MAVQGSRELFAAIGPEEAWI> 
2110 2120 2130 2140 2150 2160 

TCCCAGACAGTGGGAGAGGCCAACTTCCTGCCCCCCAGTGGCTCCCCACACTGGCCCACT 
AGGGTCTGTCACCCTCTCCGGTTGAAGGACGGGGGGTCACCGAGGGGTGTGACCGGGTGA 
SQTVGEANFLPPSGSPHWP T> 
2170 2180 2190 2200 2210 2220 

GGAGGCAGCAATCTCTGCCCCCCAGAACTGGCCAAGGCCTCGGGATCCCCGCATGCAGTC 
CCTCCGTCGTTAGAGACGGGGGGTCTTGACCGGTTCCGGAGCCCTAGGGGCGTACGTCAG 
GGSNLCPPELAKASGS pha v> 
2230 2240 2250 2260 2270 2280 

TACTTCCTGCTCCTGCTGACTCTCCTGATCCAGAGCTGAGGGGCTAGGGTCCCAGCACCA 
ATGAAGGACGAGGACGACTGAGAGGACTAGGT CT CGACT C C C CGATC C C AGGGTCGTGGT 
YFLLLLTLLIQS *> 
2290 
CTTCCCCCTTCTC 
GAAGGGGGAAGAG 

CVSP16 polypeptide sequence 
Sequence Range: 1 to 753 

10 20 30 40 50 60 

MARQLLLPL WL VI S P I PGAFQDSAL S PTQE E PEDLDCGRPE PS AR I VGGSNAQPGTWP W 
70 80 90 100 110 120 

QVS LHHGGGH I CGG S L I A P S WVL S AAH C FMTNGTL E P AAE W S VL LGVH S QDG P LDG AHTR 

130 140 150 160 170 180 

AVAAIWPANYSQVELGADLALLRLASPASLGPAVWPVCLPRASHRFVHGTACWATGWGD 

190 200 210 220 230 240 

VQEADPLPLPWVLQEVELRLLGEATCQCLYSQPGPFNLTLQILPGMLCAGYPGGRRDTCQ 

250 260 270 280 290 300 

GDSGGPDVCEEGGRWFQAG ITS FGFGCGRRNRPGVFTAVATYEAW I REQVMGSE PG PAF P 

310 320 330 340 350 360 

TQPQKTQSDPQEPREENCTIALPECGKAPRPGAWPWEAQV^PGSRPCHGALVSESWVLA 

370 380 390 400 410 420 

PASCFLDPNSSDSPPRDLDAWRVLLPSHPRAERVARLVQHENASWDNAPDLALLQLRTPV 

430 440 450 460 470 480 

NLSAASRPVCLPHPEHYFLPGSRCRLARWGRGEPALGPGALLEAELLGGWWCHCLYGRQG 

490 500 510 520 530 540 

AAVPL PGDPPHALCPAYQEKEEVGS CWNDSRWS LLCQEEGTWFLAG I RDF PS GCLR PRAF 

550 560 570 580 590 600 

FPLQTHGPWISHVTRGAYLEDQLAWDWGPDGEETETQTCPPHTEHGACGLRLEAAPVGVL 

610 620 630 640 650 660 

WPWLAEVHVAGDRVCTG I Ll^PGWVLAATHCVLRPGSTTVPY I EVYLGRAGAS SLPQGHQ 

670 680 690 700 710 720 

MTSAPPLLCQMTEGSWILVGMAVQGSRELFAAIGPEEAWISQTVGEANFLPPSGSPHWPT 

730 740 750 

GGSNLCPPELAKASGS PHA VYFLLLLTLL I QS* 
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EXAMPLE 2 
Expression of the protease domains 

Nucleic acid encoding each a full length CVSP16 and/or protease 
domain thereof can be cloned into a derivative of the Pichia pastoris 
5 vector pPIC9K (available from Invitrogen; see SEQ ID NO. 13) or pPIC9KX 
(described below), which is introduced into a suitable Pichia host or other 
compatible host and used to express the encoded CVSP1 6 or portion 
thereof. 

Plasmid pPIC9K features include the 5' AOX1 promoter fragment at 

10 1-948; 5' AOX1 primer site at 855-875; alpha-factor secretion signal(s) at 
949-1218; alpha-factor primer site at 1 152-1 172; multiple cloning site at 
1 192-1241; 3' AOX1 primer site at 1327-1347; 3' AOX1 transcription 
termination region at 1253-1586; HIS4 ORF at 4514-1980; kanamycin 
resistance gene at 5743-4928; 3' AOX1 fragment at 6122-6879; ColE1 

15 origin at 7961-7288; and the ampicillin resistance gene at 8966-8106. 
The plasmid pPIC9KX is derived from pPIC9K by eliminating the Xhol site 
in the kanamycin resistance gene to produce pPIC9KX. 

Other vectors that can be used for expression of CVSP1 6 or 
portions thereof include, but are not limited to, insect and mammalian 

20 vectors as described, for example, above. The protein also can be 

expressed in E. coli, for example, as inclusion bodies in the cytoplasm or 
in the cytoplasm using the strain Origami (i.e., Origami B from Novagen, 
Madison Wl) that permits folding in the cytoplasm. CVSP16 also can be 
expressed in the periplasmic space. 

25 EXAMPLE 3 

Assays for identification of candidate compounds that modulate the 
activity of a serine protease 

Assay for identifying inhibitors 

The ability of test compounds to act as inhibitors of catalytic 
30 activity of a catalytic activity of a CVSP16 polypeptide can be assessed in 
an amidolytic assay. Compound mediated inhibition of amidolytic activity 



-181- 



24745-1625 

of a CVSP1 6 polyeptide or a protease domain portion thereof, can be 
measured by IC50 values in such an assay. 

An exemplary assay buffer is HBSA (10 mM Hepes, 150mM 
sodium chloride, pH 7.4, 0.1 % bovine serum albumin). All reagents can 
5 be purchased from Sigma Chemical Co. (St. Louis, MO), unless otherwise 
indicated. Two IC50 assays at 30-minute (a 30-minute preincubation of 
test compound and enzyme) and at 0-minutes (no preincubation of test 
compound and enzyme) are conducted. For the IC50 assay at 30-minute, 
the following reagents are combined in appropriate wells of a Corning 

10 microtiter plate: 50 microliters of HBSA, 50 microliters of the test 

compound, diluted (covering a broad concentration range) in HBSA (or 
HBSA alone for the uninhibited velocity measurement), and 50 microliters 
of the SP or protease domain thereof diluted in buffer, yielding a final 
enzyme concentration of about 0.5-5 nM. Following a 30-minute 

15 incubation at ambient temperature, the assay is initiated by the addition 
of 50 microliters of a substrate for the particular SP (see, e.g. , table and 
discussion below), which was reconstituted in deionized water, and 
diluted in HBSA prior to the assay, yielding a final volume of 200 
microliters and a final substrate concentration of 200-600 //M. 

20 For an IC50 assay at 0-minute, the same reagents are combined: 

50 microliters of HBSA, 50 microliters of the test compound, diluted 
(covering the identical concentration range) in HBSA (or HBSA alone for 
uninhibited velocity measurement), and 50 microliters of the substrate, 
such as a chromogenic substrate. The assay is initiated by the addition of 

25 50 microliters of SP. The final concentrations of all components are 
identical in both IC50 assays (at 30- and 0-minute incubations). 

The initial velocity of the substrate hydrolysis is measured in both 
assays by, for example for a chromogenic substrate, the change in 
absorbance at a particular wavelength, using a Thermo Max® Kinetic 

30 Microplate Reader (Molecular Devices) over a 5 minute period, in which 
less than 5% of the added substrate was hydrolyzed. The concentration 
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of added inhibitor, which caused a 50% decrease in the initial rate of 
hydrolysis was defined as the respective IC50 value in each of the two 
assays (30-and 0-minute). 

Another assay for identifying inhibitors 
5 Test compounds for inhibition of the protease activity of the 

protease domain are assayed in Costar 96 well tissue culture plates 
(Corning NY). Approximately 0.5-5 nM of the CVSP16 or protease 
domain thereof is mixed with varying concentrations of inhibitor in 29.2 
mM Tris, pH 8.4, 29.2 mM imidazole, 217 mM NaCI (100 mL final 

10 volume) and allowed to incubate at room temperature for 30 minutes. 
About 200-600 jjM substrate is added, and the reaction monitored in a 
SpectraMAX® Plus microplate reader (Molecular Devices, Sunnyvale CA) 
by following the change in a parameter correlated with hydrolysis, such 
as absorbance for a chromogenic substrate for 1 hour at 37° C. 

15 Alternative assay for screening CVSP16 

The protease domain of CVSP1 6 or full-length polypeptide or other 
catalytically active portion thereof is expressed in Pichia pastoris. Test 
compounds are screened for modulation of the activity of the CVSP16 
polypeptide or portion thereof. Approximately 1-20 nM CVSP16 is mixed 

20 in Costar 96 well tissue culture plates (Corning NY) with varying 

concentrations of test compounds and/or known inhibitors or agonsists in 
29.2 mM Tris, pH 8.4, 29.2 mM Imidazole, 217 mM NaCI (100 //L final 
volume), and allowed to incubate at room temperature for 30 minutes. 
200-600 jjM of a chromogenic substrate is added, and the reaction is 

25 monitored in a SpectraMAX Plus microplate reader (Molecular Devices, 
Sunnyvale CA) by measuring the change in absorbance at 405 nm for 30 
minutes at 37 °C. 
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Identification of substrates 

Particular substrates for use in the assays can be identified 
empirically by testing substrates. The following list of substrates are 



exemplary of those that can be tested. 



Substrate name 


Structure 


S 2366 


pyroGlu-Pro-Arg-pNA.HCI 


spectrozyme t-PA 


Un 3 oU 2 -L)-rin I -oiy-Arg-plMA.ACUH 


N-p-tosyi-oiy-rro-Arg-pNA 


N-p-tosyl-Gly-Pro-Arg-pNA 


benzoyi-vai-uiy-Arg-pNA 


Benzoyl-Val-Gly-Arg-pNA 


Pefachrome t-PA 


Ln 3 oU 2 -U-nn I -oiy-Arg-pIMA 


o Z/DO 


i\i -o- Z.-U- A rg-oiy- a rg-pi\j a. Zr\K^\ \ 


O OA A A 


pyrooiu-Ljiy-Arg-pNA.nUi 


C OOQQ 

O z/oo 


M-u^ne-rro-Arg-piMA.znui 


spectrozyme UK 


UDO-L-lK/^lUia-t-DUUJ-oly-Arg-pNA.zACUn i 


o zoUz 


n-u-rro~rne-Arg-pi>iA.znLri 




l_l P\ \/~| 1 a, , A rn n M A OUPI 1 

n-u- vai-Leu-Arg-pr>JA .zmui 


o O O O O 

o ZZZZ 


D-» II*-* O D\ l\ / A rt M A LI 1 

DZ-iie-oiu(g-UK)-oiy-Arg-pNA.nUi 
R = H(50%) and R = CHJ50%) 


Chromozyme PK 


Benzoyl-Pro-Phe-Arg-pNA 


S 2238 


H-D-Phe-Pip-Arg-pNA.2HCI 


S 2251 


H-D-Val-Leu-Lys-pNA.2HCI 


Spectrozyme PI 


H-D-Nle-HHT-Lys-pNA.2AcOH 




Pyr-Arg-Thr-Lys-Arg-AMC 




H-Arg-Gln-Arg-Arg-AMC 




Boc-Gln-Gly-Arg-AMC 




Z-Arg-Arg-AMC 


Spectrozyme THE 


H-D-HHT-Ala-Arg-pNA.2AcOH 


Spectrozyme fXlla 


H-D-CHT-Gly-Arg-pNA.2AcOH 




CVS 2081-6 (MeS0 2 -dPhe-Pro-Arg-pNA) 




Pefachrome fVlla (CH 3 S0 2 -D-CHA-But-Arg-pNA) || 



pNA = para-nitranilide (chromogenic) 

AMC = amino methyl coumarin (fluorescent) 



If none of the above substrates are cleaved, a coupled assay can 
be used. Briefly, such assays test the ability of the protease to activate 
an enzyme, such as plasminogen and trypsinogen. To perform these 
assays, the single chain protease is incubated with a zymogen, such as 
plasminogen or trypsinogen, in the presence of a known substrate for 
plasmin or trypsin, such as a Spectrozyme substrate. If a single chain 
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CVSP1 6 activates the zymogen, the activated enzyme, such as plasmin 
and trypsin, will degrade the substrate therefor. 

EXAMPLE 4 

Other Assays 

5 These assays are described with reference to MTSP1 , but such 

assays can be readily adapted for use with CVSP16. 

Amidolytic Assay for Determining Inhibition of Serine 
Protease Activity of Matriptase or MTSP1 

The ability of test compounds to act as inhibitors of rMAP catalytic 

10 activity was assessed by determining the inhibitor-induced inhibition of 
amidolytic activity by the MAP, as measured by IC 50 values. The assay 
buffer was HBSA (10 mM Hepes, 1 50mM sodium chloride, pH 7.4, 0.1% 
bovine serum albumin). All reagents were from Sigma Chemical Co. (St. 
Louis, MO), unless otherwise indicated. 

15 Two IC 50 assays (a) one at either 30-minutes or 60-minutes (a 30- 

minute or a 60-minute preincubation of test compound and enzyme) and 
(b) one at 0-minutes (no preincubation of test compound and enzyme) 
were conducted. For the IC 50 assay at either 30-minutes or 60-minutes, 
the following reagents were combined in appropriate wells of a Corning 

20 microtiter plate: 50 microliters of HBSA, 50 microliters of the test 

compound, diluted (covering a broad concentration range) in HBSA (or 
HBSA alone for uninhibited velocity measurement), and 50 microliters of 
the rMAP (Corvas International) diluted in buffer, yielding a final enzyme 
concentration of 250 pM as determined by active site titration. Following 

25 either a 30-minute or a 60-minute incubation at ambient temperature, the 
assay was initiated by the addition of 50 microliters of the substrate S- 
2765 (N-a-Benzyloxycarbonyl-D-arginyl-L-glycyl-L-arginine-p-nitroaniline 
dihydrochloride; DiaPharma Group, Inc.; Franklin, OH) to each well, 
yielding a final assay volume of 200 microliters and a final substrate 

30 concentration of 100//M (about 4-times K m ). Before addition to the assay 
mixture, S-2765 was reconstituted in deionized water and diluted in 
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HBSA. For the IC 50 assay at 0 minutes; the same reagents were 
combined: 50 microliters of HBSA, 50 microliters of the test compound, 
diluted (covering the identical concentration range) in HBSA (or HBSA 
alone for uninhibited velocity measurement), and 50 microliters of the 
5 substrate S-2765. The assay was initiated by the addition of 50 

microliters of rMAP. The final concentrations of all components were 
identical in both IC 50 assays (at 30- or 60- and 0-minute). 

The initial velocity of chromogenic substrate hydrolysis was 
measured in both assays by the change of absorbance at 405 nM using a 

10 Thermo Max® Kinetic Microplate Reader (Molecular Devices) over a 5 
minute period, in which less than 5% of the added substrate was used. 
The concentration of added inhibitor, which caused a 50% decrease in 
the initial rate of hydrolysis was defined as the respective IC 50 value in 
each of the two assays (30- or 60-minutes and 0-minute). 

15 In vitro enzyme assays for specificity determination 

The ability of compounds to act as a selective inhibitor of 
matriptase activity was assessed by determining the concentration of test 
compound that inhibits the activity of matriptase by 50%, (IC 50 ) as 
described in the above Example, and comparing IC 50 value for matriptase 

20 to that determined for all or some of the following serine proteases: 
thrombin, recombinant tissue plasminogen activator (rt-PA), plasmin, 
activated protein C, chymotrypsin and factor Xa. 

The buffer used for all assays was HBSA (10 mM HEPES, pH 7.5, 
150 mM sodium chloride, 0.1% bovine serum albumin). The assay for 

25 IC 50 determinations was conducted by combining in appropriate wells of a 
Corning microtiter plate, 50 microliters of HBSA, 50 microliters of the test 
compound at a specified concentration (covering a broad concentration 
range) diluted in HBSA (or HBSA alone for V 0 (uninhibited velocity) 
measurement), and 50 microliters of the enzyme diluted in HBSA. 

30 Following a 30 minute incubation at ambient temperature, 50 microliters 
of the substrate at the concentrations specified below were added to the 
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wells, yielding a final total volume of 200 microliters. The initial velocity 
of chromogenic substrate hydrolysis was measured by the change in 
absorbance at 405 nm using a Thermo Max® Kinetic Microplate Reader 
over a 5 minute period in which less than 5% of the added substrate was 
5 used. The concentration of added inhibitor which caused a 50% 

decrease in the initial rate of hydrolysis was defined as the IC 50 value. 
Thrombin (flla) Assay 
Enzyme activity was determined using the chromogenic substrate, 
Pefachrome t-PA (CH 3 S0 2 -D-hexahydrotyrosine-glycyl-L-Arginine-p- 

10 nitroaniline, obtained from Pentapharm Ltd.). The substrate was 

reconstituted in deionized water prior to use. Purified human a-thrombin 
was obtained from Enzyme Research Laboratories, Inc. The buffer used 
for all assays was HBSA (10 mM HEPES, pH 7.5, 150 mM sodium 
chloride, 0.1% bovine serum albumin). 

15 IC 50 determinations were conducted where HBSA (50 jjL), 

a-thrombin (50 fj\) (the final enzyme concentration is 0.5 nM) and inhibitor 
(50 (covering a broad concentration range), were combined in 
appropriate wells and incubated for 0 or 30 minutes at room temperature 
prior to the addition of substrate Pefachrome-t-PA (50 //I) (the final 

20 substrate concentration is 250 jjM, about 5 times Km). The initial 

velocity of Pefachrome t-PA hydrolysis was measured by the change in 
absorbance at 405 nm using a Thermo Max® Kinetic Microplate Reader 
over a 5 minute period in which less than 5% of the added substrate was 
used. The concentration of added inhibitor which caused a 50% 

25 decrease in the initial rate of hydrolysis was defined as the IC 50 value. 
Factor Xa 

Factor Xa catalytic activity was determined using the chromogenic 
substrate S-2765 (N-benzyloxycarbonyl-D-arginine-L-glycine-L-arginine-p- 
nitroaniline), obtained from DiaPharma Group (Franklin, OH). All 
30 substrates were reconstituted in deionized water prior to use. The final 
concentration of S-2765 was 250 jl/M (about 5-times Km). Purified 
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human Factor X was obtained from Enzyme Research Laboratories, Inc. 
(South Bend, IN) and Factor Xa (FXa) was activated and prepared from it 
as described (Bock, P.E., Craig, P.A., Olson, S.T., and Singh, P. Arch. 
Biochem. Biophys. 273:375-388 (1989)) The enzyme was diluted into 
5 HBSA prior to the assay were the final concentration was 0.25 nM. 
Recombinant tissue plasminogen activator (rt-PA) Assay 
rt-PA catalytic activity was determined using the substrate, 
Pefachrome t-PA (CH 3 S0 2 -D-hexahydrotyrosine-glycyl-L-arginine-p-nitro- 
aniline, obtained from Pentapharm Ltd.). The substrate was made up in 

10 deionized water followed by dilution in HBSA prior to the assay where the 
final concentration was 500 micromolar (about 3-times Km). Human 
rt-PA (Activase®) was obtained from Genentech Inc. The enzyme was 
reconstituted in deionized water and diluted into HBSA prior to the assay 
where the final concentration was 1 .0 nM. 

15 Plasmin Assay 

Plasmin catalytic activity was determined using the chromogenic 
substrate, S-2366 [L-pyroglutamyl-L-prolyl-L-arginine-p-nitroaniline 
hydrochloride], which was obtained from DiaPharma group. The 
substrate was made up in deionized water followed by dilution in HBSA 

20 prior to the assay in which the final concentration was 300 micromolar 

(about 2.5-times Km). Purified human plasmin was obtained from Enzyme 
Research Laboratories, Inc. The enzyme was diluted into HBSA prior to 
the assay where final concentration was 1 .0 nM. 

Activated Protein C (aPC) Assay 

25 aPC catalytic activity was determined using the chromogenic 

substrate, Pefachrome PC (delta-carbobenzloxy-D-lysine-L-prolyl-L- 
arginine-p-nitroaniline dihydrochloride), obtained from Pentapharm Ltd.). 
The substrate was made up in deionized water followed by dilution in 
HBSA prior to the assay where the final concentration was 400 

30 micromolar (about 3-times Km). Purified human aPC was obtained from 
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Hematologic Technologies, Inc. The enzyme was diluted into HBSA prior 
to the assay where the final concentration was 1 .0 nM. 
Chymotrypsin Assay 
Chymotrypsin catalytic activity was determined using the 
chromogenic substrate, S-2586 (methoxy-succinyl-L-arginine-L-prolyl-L- 
tyrosyl-p-nitroanilide), which was obtained from DiaPharma Group. The 
substrate was made up in deionized water followed by dilution in HBSA 
prior to the assay where the final concentration was 100 micromolar 
(about 9-times Km). Purified (3X-crystallized; CDI) bovine pancreatic 
alpha-chymotrypsin was obtained from Worthington Biochemical Corp. 
The enzyme was reconstituted in deionized water and diluted into HBSA 
prior to the assay where the final concentration was 0.5 nM. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
claims. 
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